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Glycosylphosphatidylinositol Specific Phospholipase D 



Proteins and Uses Thereof 



Field of the Invention 

The present invention relates to 

glycosylphosphatidylinositol specific phospholipase D 
(GPI-PLD) proteins and uses of these proteins, in 
particular in the treatment and diagnosis of diabetes and 
complications of diabetes such as insulin resistance. 

Background of the Invention 

Studies have shown that a number of cell surface proteins 
are attached to the cell membrane by covalent linkage to 
a glycosylphosphatidylinositol (GPI) anchor. It has been 
shown that the enzyme GPI-PLD cleaves the photodiester 
bond linking glycosylphosphatidylinositol to phosphatidic 
acid, thereby releasing anchored proteins. 

GPI-PLD enzymes are abundantly present in human and 
bovine serum (5-10ug/ml in human serum) . US Patent No: 
5,418,147 (Huang et al) describes the purification of 
GPI-PLD from bovine liver, and the subsequent cloning of 
three GPI-PLD enzymes from bovine liver, human liver and 
human pancreas cDNA libraries. This patent reports the 
full length cDNA and amino acid sequences of the GPI-PLDs 
from human and bovine liver, and the partial cDNA and 
amino acid sequences of the human pancreatic form of the 
enzyme. Subsequently, the full length sequence of the 
pancreatic form of GPI-PLD was reported in Tsang et al 
(1992) , and this enzyme has been found in cDNA libraries 
from breast, eye, spleen and tonsil. The three forms of 
the enzymes are highly homologous with the predicted 
mature protein sequences of bovine liver GPI-PLD sharing 
82% sequence identity with the human liver enzyme and 81% 
sequence identity with the human pancreatic enzyme. The 
amino acid sequences of human liver and pancreatic forms 
of GPI-PLD were deposited at GenBank under accession 
numbers L11701 and L11702 and consist of 841 and 840 
amino acids respectively. The human liver and pancreatic 



forms of GPI-PLD share 94.6% sequence identity. The 
structure of GPI-PLDs is further discussed in Scallon et 
al, 1991. 

5 However, despite cloning three forms of GPI-PLD, there is 

no suggestion in these references as to the in vivo role 
of the enzymes. Further, the only application of the 
enzymes suggested is in an expression system in which a 
heterologous protein is expressed in a host cell as a 
10 fusion with a GPI- signal peptide, leading to the 

heterologous protein becoming anchored to the cell 
membrane by a GPI anchor, where it can be cleaved off by 
coexpressed or added GPI-PLD. 

15 GPI-PLD has also been isolated from human serum by Hoener 

et al (1992) and this form of the enzyme was found to be 
identical to the human pancreatic GPI-PLD apart from 
changes at 531 to 534 where VIGS is replaced by MLGT. 
This paper also showed that treatment of serum GPI-PLD 
20 with N-glycosidase F reduced the apparent molecular 

weight from 123 kD to 87 kD. Similarly, by Li et al 
(1994) showed GPI-PLD was cleaved by trypsin into 3 
fragments (2 x 40 kD and 30 kD) , and by Heller et al 
(1994) which showed that 33, 39 and 47kD species were 
25 produced, with only the N- terminal 39 kD fragment moiety 

showing enzyme activity after renaturation. 

It has been proposed that one function of GPI-PLD enzyme 
is to produce inositolphosphoglycans (IPGs) by the 

30 cleavage of "free" GPIs in the plasma membrane in 

response to binding of a growth factor to its receptor 
(Rademacher et al, 1994). This role for GPI-PLD has been 
demonstrated in mast cells where igE - dependent activation 
of these cells results in release of their granule 

35 contents, which include substances such as histamine, a 
mediator of the inflammatory response. In the presence 
of antigen, histamine is released; this release can be 



mimicked by addition of IPGs and is blocked by addition 
of anti-GPI-PLD antibodies (Lin et al, 1991) . 

The role of GPI-PLD in cleaving GPI- anchored proteins, 
5 and especially inositolphosphoglycans (IPGs) , is examined 

in Jones et al (1997) . However, the authors reflect the 
uncertainty in the art regarding the mechanism of IPG 
generation, noting that "The definitive activated enzyme, 
being a GPI -PLC or a GPI-PLD, has yet to be unequivocally 
10 identified" and that "little attention has been payed to 

the role of GPI-PLD as the hydrolysing enzyme". 

In summary, despite the cloning of GPI-PLD enzymes and 
investigation as to their biochemical properties, the 
15 role of the enzyme in vivo or any possible medical use 

remains unknown . 



Summary of the Invention 

Broadly, the present invention relates to GPI-PLD for 
20 medical use, and in particular to the use of GPI-PLD in 

the treatment of diabetes and complications of diabetes, 
optionally in combination with existing therapies, such 
as the administration of insulin. The GPI-PLD can be the 
forms of the enzyme disclosed in the prior art, or the 
25 GPI-PLDs disclosed for the first time here. 



Insulin is a major anabolic hormone and has both 
mitogenic and metabolic effects. Whilst much effort has 
been directed towards study of the cascade of 

30 intracellular phosphorylation events initiated by the 

binding of insulin to its cell surface receptor, the 
signalling arm mediated by IPGs has been largely 
overlooked. The present invention is based on the 
realisation that GPI-PLDs are in fact the enzymes 

35 responsible for production of IPG second messengers 

following the binding of insulin to its receptor. The 
IPGs then interact with other cellular enzymes 



4 

instigating some of the metabolic effects of the hormone, 
in particular, diabetic complications such as insulin 
resistance may be caused by deficiencies in GPI-PLD. 
Pancreatic islet cells produce and secrete GPI-PLD, which 
is transported in blood complexed with apolipoprotein Al, 
and may therefore represent the major source of 
circulating enzyme. 

insulin resistance is seen in both the early stages of 
type I (IDDM) and type II diabetes mellitus (NIDDM) . If 
GPI-PLD levels are depleted by the destruction of 
pancreatic (5 -cells, as is seen in streptozotocin- treated 
rats, then this could be an important factor in the 
development of insulin resistance. This in turn suggests 
the treatment of such patients with GPI-PLD, optionally 
in combination with other diabetes therapies. 

Accordingly, in a first aspect, the present invention 
provides GPI-PLD for use in a method of medical 
treatment . 

In a further aspect, the present invention provides the 
use of GPI-PLD for the preparation of a medicament for 
the treatment of diabetes, and in particular insulin 
dependent forms of diabetes. 

In a further aspect, the present invention provides the 
use of GPI-PLD for the preparation of a medicament for 
the treatment of complications of diabetes, and in 
particular the treatment of insulin resistance. 

In a further aspect, the present invention provides a 
method of treating a patient having diabetes or 
complications arising from diabetes, the method 
comprising administering to the patient a therapeutically 
effective amount of GPI-PLD. 
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In all of the above aspects, GPI-PLD can be administered 
alone or in conjunction with other treatments for 
diabetes or diabetic complications, either sequentially 
or simultaneously. 

In a further aspect, the present invention provides a kit 
comprising a composition including GPI-PLD and a second 
composition for the treatment of diabetes. 



In a further aspect, the present invention provides an 
expression vector comprising nucleic acid encoding GPI- 
PLD for use in a method of gene therapy, e.g. in the 
treatment of patients unable to produce sufficient GPI- 
PLD. The GPI-PLD encoding nucleic acid can be a sequence 
15 shown in figures 4 to 6 or one of the known nucleic acid 

sequences . 



In a further aspect, the present invention provides a 
cell line for transplantation into a patient, wherein the 

20 cell line is transformed with nucleic acid encoding GPI- 

PLD, and is capable of expressing and secreting GPI-PLD. 
In one embodiment, the cell line is encapsulated, e.g. in 
a biocompatible polymer, so that the GPI-PLD produced by 
the cell line can be secreted into the patient, while 

25 preventing rejection by the immune system of the host. 

Methods for encapsulating cells in biocompatible polymers 
are described in W093/16687 and W096/31199. 

in a further aspect, the present invention provides a 
30 pharmaceutical composition comprising a nucleic acid 

molecule encoding a GPI-PLD protein. 

In a further aspect, the present invention provides a 
pharmaceutical composition comprising a GPI-PLD protein. 



35 



In a further aspect, the present invention provides the 
use of GPI-PLD levels in the diagnosis of diabetes or 
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diabetic complications. Thus, the present invention 
provides a method of diagnosing diabetes or diabetic 
complications, the method comprising determining the 
amount of GPI-PLD in a biological sample from a patient. 
This determination can help in the diagnosis or prognosis 
of the patient, allowing the treatment of the patient to 
be tailored accordingly to the patient's individual 
needs . 

in one embodiment, the method of diagnosing diabetes or 
diabetic complications comprises the steps of: 

(a) contacting a biological sample obtained from 
the patient with a solid support having immobilised 
thereon a binding agent having binding sites specific for 

15 GPI-PLD; 

(b) contacting the solid support with one or more 

labelled developing agents capable of binding to 
unoccupied binding sites, bound GPI-PLD or occupied 

binding sites; and, 
20 (c, detecting the label of the developing agents 

specifically binding in step (b) to obtain a value 
representative of the amount of GPI-PLD in the sample. 

Alternatively or additionally, the method can assess GPI- 
25 PLD levels by measuring one of its biological activities, 
which are discussed further below. 

The present invention also relates to novel GPI-PLD 
proteins and nucleic acid molecules, and in particular to 
30 forms of the protein having sequence differences compared 

to the known human liver and pancreatic forms reported in 
the prior art. 

in a further aspect, the present invention provides a 
35 substance which is an isolated polypeptide comprising a 

polypeptide having the amino acid sequence set out in 
figure 3. 



In a further aspect, the present invention provides 
isolated nucleic acid molecules encoding any one of the 
above polypeptides. Examples of such nucleic acid 
sequences are the nucleic acid sequences set out in 
5 figures 4 to 6 . The present invention also include 

nucleic molecules having greater than 90% sequence 
identity with the nucleic acid sequences shown in these 
figures ♦ 

10 In further aspects, the present invention provides an 

expression vector comprising the above GPI-PLD proteins, 
nucleic acid operably linked to control sequences to 
direct its expression, and host cells transformed with 
the vectors. The present invention also includes a 

15 method of producing the above GPI-PLD proteins comprising 

culturing the host cells and isolating the GPI-PLD thus 
produced . 

These and other aspects of the present invention are 
20 described in more detail below. 

By way of example, embodiments of the present invention 
will now be described in more detail with reference to 
the accompanying figures . 

25 

Brief Description of the Figures 

Figure 1 shows an alignment of the deduced amino acid 
sequences of GPI-PLD encoded by cDNA clone Al and the 
bovine and human liver GPI-PLD sequences disclosed in US 
30 Patent No: 5,418,147 (Huang et al) . 

Figure 2 shows the nucleic acid sequence from cDNA clone 
Al aligned with the pancreatic forms of GPI-PLD disclosed 
in US Patent No: 5,418,147 (Huang et al) (partial 
35 sequence) and the corresponding full length nucleic acid 

sequence deposited at GenBank. 
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Figure 3 shows the amino acid sequences of the GPl-PLDs 
in clones al, b2 and d3, and consist of 840, 795 and 510 
amino acids respectively. 

Figure 4 shows the nucleic acid sequence of cDNA clone al 
encoding GPI-PLD, consisting of 2832 bp. 

Figure 5 shows the nucleic acid sequence of cDNA clone b2 
encoding GPI-PLD, consisting of 2472 bp. 

Figure 6 shows the nucleic acid sequence of cDNA clone d3 
encoding GPI-PLD, consisting of 1942 bp. 

Figure 7 shows an alignment of the deduced amino acid 
sequences of GPI-PLDs encoded by cDNA clones al, b2 and 
d3 with the pancreatic form of the enzyme deposited at 
GenBank under accession number 11702. 

Figure 8 shows an alignment of the nucleic acid sequences 
from cDNA clones al, b2 and d3 with the cDNA sequence 
encoding the human pancreatic form of GPI-PLD deposited 
at GenBank under accession number 11702. 

Detailed Description 

25 GPI-PLD Proteins 

The term "GPI-PLD biological activity" is herein defined 
as the enzymatic activity of GPI-PLD in cleaving the 
photodiester bond linking glycosylphosphatidylinositol to 
phosphatide acid, e.g. releasing a GPl-anchored protein. 

30 As noted in Heller et al (1994), this activity has been 

localised to the N- terminal 39 kD portion of full length 
GPI-PLD. 

The medical uses of GPI-PLD described herein can use the 
35 novel GPI-PLD variants or the forms of the enzyme 

disclosed in the prior art. in either event, the skilled 
person can use the techniques described herein and others 



20 



well known in the art to produce large amounts of these 
proteins, or fragments or active portions thereof, for 
use as pharmaceuticals, in the developments of drugs and 
for further study into its properties and role in vivo. 

5 

In a further aspect of the present invention provides a 
polypeptide having the amino acid sequence shown in 
figure 3, which may be in isolated and/or purified form, 
free or substantially free of material with which it is' 
10 naturally associated. In one embodiment, the clone al 

has an amino acid sequence consisting of 840 amino acids, 
a 23 amino acid signal peptide and a 817 amino acid 
mature protein. 



15 



20 



25 



30 



35 



GPI -PLD proteins which are amino acid sequence variants, 
alleles or derivatives can also be used in the present 
invention, a polypeptide which is a variant, allele or 
derivative may have an amino acid sequence which differs 
from that given in figures 1 or 3 by one or more of 
addition, substitution, deletion and insertion of one or 
more amino acids. Preferred polypeptides have GPI-pld 
enzymatic function as defined above. 

A GPI-pld protein which is an amino acid sequence 
variant, allele or derivative of an amino acid sequence 
shown in figures 1 or 3 may comprise an amino acid 
sequence which shares greater than about 70%, greater 
than about 80%, greater than about 90%, greater than 
about 95%, greater than about 97%, greater than about 98% 
or greater than about 99% sequence identity with an amino 
acid sequence shown in figures 1 or 3 . Sequence 
comparison and identity calculations were carried out 
using the Cluster program (Thompson et al, 1994), using 
the following parameters (Pairwise Alignment Parameters- 
Weight Matrix: pam series; Gap Open Penalty: 10.00- Gap 
Extension Penalty: 0.10). Alternatively, the GCG program 
could be used which is available from Genetics Computer 
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Group, Oxford Molecular Group, Madison, Wisconsin, USA, 
version 9.1. Particular amino acid sequence variants may 
differ from those shown in figures 1 and 3 by insertion, 
addition, substitution or deletion of 1 amino acid, 2,3, 
5 4, 5-10, 10-20 20-30, 30-50, 50-100, 100-150, or more 

than 150 amino acids. 

The present invention also includes the use of active 
portions, fragments and derivatives of the GPI-PLD 
10 proteins. 

An -active portion" of GPI-PLD protein is a polypeptide 
which is less than said full length GPI-PLD protein, but 
which retains at least one its essential biological 
15 activity, e.g. the enzyme activity mentioned above. For 

instance, portions of GPI-PLD protein can act as 
sequestrators or competitive antagonists by interacting 
with other proteins. 

20 A "fragment" of the GPI-PLD protein means a stretch of 
amino acid residues of at least about 5 to 7 contiguous 
amino acids, often at least about 7 to 9 contiguous ammo 
acids, typically at least about 9 to 13 contiguous amino 
acids and, most preferably, at least about 20 to 30 or 

25 more contiguous amino acids. 

A -derivative" of the GPI-PLD protein, or a fragment 
thereof, means a polypeptide modified by varying the 
amino acid sequence of the GPI-PLD protein, e.g. by 

30 manipulation of the nucleic acid encoding the protein or 
by altering the protein itself. Such derivatives of the 
natural amino acid sequence may involve insertion, 
addition, deletion or substitution of one, two, three, 
five or more amino acids, without fundamentally altering 

35 a biological activity of the wild type GPI-PLD protein. 

A polypeptide according to the present invention may be 



isolated and/or purified (e.g. using an antibody) for 
instance after production by expression from encoding 
nucleic acid (for which see below) . Polypeptides 
according to the present invention may also be generated 
wholly or partly by chemical synthesis. The isolated 
and/or purified polypeptide may be used in formulation of 
a composition, which may include at least one additional 
component, for example a pharmaceutical composition 
including a pharmaceutically acceptable excipient, 
vehicle or carrier. A composition including a 
polypeptide according to the invention may be used in 
prophylactic and/or therapeutic treatment as discussed 
below. 

The GPI-PLD polypeptides can also be linked to a coupling 
partner, e.g. an effector molecule, a label, a drug, a 
toxin and/or a carrier or transport molecule. Techniques 
for coupling the peptides of the invention to both 
peptidyl and non-peptidyl coupling partners are well 
known in the art. In one embodiment, the carrier 
molecule is a 16 aa peptide sequence derived from the 
homeodomain of Antennapedia (e.g. as sold under the name 
"Penetratin") , which can be coupled to a peptide via a 
terminal Cys residue. The ''Penetratin" molecule and its 
properties are described in W09 1/189 81. 

Pharmaceutical Compositions 

As mentioned above, GPI-PLD proteins can used for 
treating diabetes and the complications of diabetes (e.g. 
insulin resistance) , optionally in conjunction with other 
treatments for these disorders. Thus, the GPI-PLD 
proteins can be formulated in pharmaceutical 
compositions, which may comprise, in addition to one of 
the above substances, a pharmaceutically acceptable 
excipient, carrier, buffer, stabiliser or other materials 
well known to those skilled in the art. Such materials 
should be non- toxic and should not interfere with the 
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efficacy of the active ingredient. The precise nature of 
the carrier or other material may depend on the route of 
administration, e.g. oral, intravenous, cutaneous or 
subcutaneous, nasal, intramuscular, intraperitoneal 
routes . 

Pharmaceutical compositions for oral administration may 
be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an 
adjuvant. Liquid pharmaceutical compositions generally 
include a liquid carrier such as water, petroleum, animal 
or vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other 
saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, or 
injection at the site of affliction, the active 
ingredient will be in the form of a parenterally 
acceptable aqueous solution which is pyrogen- free and has 
suitable pH, isotonicity and stability. Those of 
relevant skill in the art are well able to prepare 
suitable solutions using, for example, isotonic vehicles 
such as sodium chloride injection. Ringer's injection, 
lactated Ringer's injection. Preservatives, stabilisers, 
buffers, antioxidants and/or other additives may be 
included as required. 

Whether it is a polypeptide, peptide, nucleic acid 
molecule, small molecule or other pharmaceutical^ useful 
compound of the invention that is to be given to an 
individual, administration is preferably in a 
"prophylactically effective amount" or a "therapeutically 
effective amount" (as the case may be, although 
prophylaxis may be considered therapy) , this being 
sufficient to show benefit to the individual. The actual 
amount administered, and rate and time-course of 
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administration, will depend on the nature and severity of 
what is being treated. Prescription of treatment, e.g. 
decisions on dosage etc, is within the responsibility of 
general practitioners and other medical doctors, and 
5 typically takes account of the disorder to be treated, 

the condition of the individual patient, the site of 
delivery, the method of administration and other factors 
known to practitioners. Examples of the techniques and 
protocols mentioned above can be found in Remington's 
10 Pharmaceutical Sciences, 16th edition, Oslo, A. (ed) , 

1980. 

GPI-PLD proteins can be administered alone or in 
combination with other treatments for diabetes or 

15 diabetic complications, either simultaneously or 

sequentially. Examples of known diabetes treatments 
include (a) insulin, which is typically delivered by 
injection, (b) oral insulin compositions, (c) glucose 
sparing or insulin enhancing drugs, (d) a-glucosidase 

20 inhibitors to reduce carbohydrate absorption (precose and 

miglitol) , and (e) drugs used to treat patients with 
insulin sensitivity, e.g. thiazolidinediones , such as 
Rezulin, rosiglitazone, piogliazone and tyrosine 
phosphatase inhibitors. 

25 

In further embodiments, the GPi-PLD can be administered 
with P and/or A- type IPGs, and/or antagonists of these 
substances. Methods for obtaining A -type and P-type IPGs 
and their antagonists are set out in Caro et al, 1997, 
30 and in W098/11116 and W098/11117. 

The role of P and A- type IPGs and their use in the 
diagnosis and treatment of diabetes is disclosed in 
W098/11435. In summary, this application discloses that 
35 in some forms of diabetes the ratio of P:A type IPGs is 

imbalanced and can be corrected by administering a 
medicament comprising the appropriate ratio of P or A- 
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type IPGs or antagonist thereof. In particular, 
W09 8/11435 describes the treatment of obese type II 
diabetes (NIDDM) patients with a P-type IPG or with an A- 
type IPG antagonist, such as antibodies which bind 
specifically to the A- type IPG, and the treatment of IDDM 
or lean type II diabetes (NIDDM) (body mass index < 27) 
with a mixture of A and P-type IPGs, typically in a P:A 
ratio of about 6:1 for males and 4:1 for females. 

The compositions of the invention can be used in the 
treatment of diabetes, in particular insulin dependent 
forms of diabetes (type I and type II diabetes) . They 
can also be used in the treatment of the complications of 
diabetes and in particular forms of insulin resistance 
such as insulin resistance in type I or type II diabetes 
and brittle diabetes. The compositions may also be used 
to treat other conditions mediated by insulin, and in 
particular insulin underproduction such as neutrotrophic 
disorders or polycystic ovary disease. 



GPI-PT.r> nucleic acid 

"GPI-PLD nucleic acid" includes a nucleic acid molecule 
which has a nucleotide sequence encoding a polypeptide 
which includes the amino acid sequence shown in figures 4 
to 6, and in some embodiments of the invention extends to 
the known human liver and pancreatic forms of GPI-PLD 
(L11701 and L.11702). These forms of GPI-PLD have been 
mapped to human chromosome 6 and are contained in the 4 
centimorgan region of D6S1660 -D6S1558 at positions 95.95 
and 99.71 (NCBI GeneMap'98). This corresponds to the 
cytogenetic region of 6p21.3. This region also contains 
the IDDM1 and HLA loci (although the HLA genes map to the 
adjacent D6S1558 -D6S1616 interval) . The mouse GPI-PLD 
gene has also been mapped to chromosome 13, near the fim 
1 locus, which is found in humans on chromosome 6. 



The GPI-PLD coding sequence may be that shown in figures 
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2, 4 to 6 or 8, a complementary nucleic acid sequence, or 
it may be a mutant, variant, derivative or allele of 
these sequences. The sequence may differ from that shown 
by a change which is one or more of addition, insertion, 
deletion and substitution of one or more nucleotides of 
the sequence shown. Changes to a nucleotide sequence may 
result in an amino acid change at the protein level, or 
not, as determined by the genetic code. 

The encoded polypeptide may comprise an amino acid 
sequence which differs by one or more amino acid residues 
from the amino acid sequence shown in the figures. 
Nucleic acid encoding a polypeptide which is an amino 
acid sequence mutant, variant, derivative or allele of 
the sequence shown in figures 1, 3 or 7 is further 
provided by the present invention. Such polypeptides are 
discussed below. Nucleic acid encoding such a 
polypeptide may show greater than about 70% identity, 
greater than about 80% identity, greater than about 9 0% 
identity, greater than about 95% identity, greater than 
about 98% identity, or greater than about 99% identity 
with a sequence shown in the figures. 

The present invention also includes fragments of the GPI- 
PLD nucleic acid sequences described herein, the 
fragments preferably being at least 12, 15, 30, 45, 60, 
or 120 nucleotides in length. 

Generally, nucleic acid according to the present 
invention is provided as an isolate, in isolated and/or 
purified form, or free or substantially free of material 
with which it is naturally associated, such as free or 
substantially free of nucleic acid flanking the gene in 
the human genome, except possibly one or more regulatory 
sequence (s) for expression. Nucleic acid may be wholly 
or partially synthetic and may include genomic DNA, cDNA 
or RNA. Where nucleic acid according to the invention 



16 

includes RNA, reference to the sequence shown should be 
construed as reference to the RNA equivalent, with U 
substituted for T. 

Nucleic acid sequences encoding all or part of the GPI- 
PLD gene and/or its regulatory elements can be readily 
prepared by the skilled person using the information and 
references contained herein and techniques known in the 
art (for example, see Sambrook, Fritsch and Maniatis, 
"Molecular Cloning, A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, 1989, and Ausubel et al, Short 
Protocols in Molecular Biology, John Wiley and Sons, 
1992) . These techniques include (i) the use of the 
polymerase chain reaction (PCR) to amplify samples of 
such nucleic acid, e.g. from genomic sources, (ii) 
chemical synthesis, or (iii) amplification in E . coli. 
Modifications to the GPI-PLD sequences can be made, e.g. 
using site directed mutagenesis, to provide expression of 
modified GPI-PLD protein or to take account of codon 
preference in the host cells used to express the nucleic 
acid. 

in order to obtain expression of the GPI-PLD nucleic acid 
sequences, the sequences can be incorporated in a vector 
having control sequences operably linked to the GPI-PLD 
nucleic acid to control its expression. The use of 
expression systems has reached an advanced degree of 
sophistication. The vectors may include other sequences 
such as promoters or enhancers to drive the expression of 
the inserted nucleic acid, nucleic acid sequences so that 
the GPI-PLD protein is produced as a fusion and/or 
nucleic acid encoding secretion signals so that the 
polypeptide produced in the host cell is secreted from 
the cell. GPI-PLD protein can then be obtained by 
transforming the vectors into host cells in which the 
vector is functional, culturing the host cells so that 
the GPI-PLD protein is produced and recovering the GPI- 




17 

PLD protein from the host cells or the surrounding 
medium, Prokaryotic and eukaryotic cells are used for 
this purpose in the art, including strains of E. coli, 
yeast, and eukaryotic cells such as COS or CHO cells. 
5 The choice of host cell can be used to control the 

properties of the GPI-PLD protein expressed in those 
cells, e.g. controlling where the polypeptide is 
deposited in the host cells or affecting properties such 
as its glycosylation and phosphorylation. 

10 

PCR techniques for the amplification of nucleic acid are 
described in US Patent No: 4,683,19 5. In general, such 
techniques require that sequence information from the 
ends of the target sequence is known to allow suitable 

15 forward and reverse oligonucleotide primers to be 

designed to be identical or similar to the polynucleotide 
sequence that is the target for the amplification. PCR 
comprises steps of denaturation of template nucleic acid 
(if double -stranded) , annealing of primer to target, and 

20 polymerisation. The nucleic acid probed or used as 

template in the amplification reaction may be genomic 
DNA, cDNA or RNA. PCR can be used to amplify specific 
sequences from genomic DNA, specific RNA sequences and 
cDNA transcribed from mRNA, bacteriophage or plasmid 

25 sequences. The GPI-PLD protein nucleic acid sequences 

provided herein readily allow the skilled person to 
design PCR primers. References for the general use of 
PCR techniques include Mullis et al, Cold Spring Harbor 
Symp. Quant. Biol., 51:263, 19 87; Ehrlich (ed) , PCR 

30 Technology, Stockton Press, NY, 19 89; Ehrlich et al, 

Science, 252:1643-1650, 1991; "PCR protocols; A Guide to 
Methods and Applications", Eds. innis et al, Academic 
Press, New York, 199 0. 



35 



Nucleic acid according to the present invention is 
obtainable using one or more oligonucleotide probes or 
primers designed to hybridize with one or more fragments 
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of the nucleic acid sequence shown in the figures, 
particularly fragments of relatively rare sequence, based 
on codon usage or statistical analysis. A primer 
designed to hybridize with a fragment of the nucleic acid 
sequence shown in the above figures may be used in 
conjunction with one or more oligonucleotides designed to 
hybridize to a sequence in a cloning vector within which 
target nucleic acid has been cloned, or in so-called 
"RACE" (rapid amplification of cDNA ends) in which cDNA's 
in a library are ligated to an oligonucleotide linker and 
PCR is performed using a primer which hybridizes with a 
GPI-PLD nucleic acid sequence shown in figures and a 
primer which hybridizes to the oligonucleotide linker. 

Such oligonucleotide probes or primers, as well as the 
full-length sequence (and mutants, alleles, variants and 
derivatives) are also useful in screening a test sample 
containing nucleic acid for the presence of alleles, 
mutants and variants, especially those that lead to the 
production of inactive forms of GPI-PLD protein protein, 
the probes hybridizing with a target sequence from a 
sample obtained from the individual being tested. The 
conditions of the hybridization can be controlled to 
minimise non-specific binding, and preferably stringent 
to moderately stringent hybridization conditions are 
preferred. The skilled person is readily able to design 
such probes, label them and devise suitable conditions 
for the hybridization reactions, assisted by textbooks 
such as Sambrook et al (1989) and Ausubel et al (1992) . 

Examples of "stringent conditions" are those which: (1) 
employ low ionic strength and high temperature for 
washing, for example 0.015 M sodium chloride/0.0015 M 
sodium citrate/0.1% sodium dodecyl sulphate at 50°C; (2) 
employ during hybridisation a denaturing agent, such as 
formamide, for example, 50% (v/v) formamide with 0.1% 
BSA/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium 
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phosphate buffer at pH 6.5 with 750mM sodium chloride, 
75mM sodium citrate at 42 °C; or (3) employ 50% formamide, 
5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50mM 
sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA 
(50ug/ml) , 0.1% SDS, and 10% dextran sulphate at 42°C, 
with washes at 42°C in 0.2 x SSC and 50% formamide at 
55°C, followed by high stringency wash consisting of 0.1 
x SSC containing EDTA at 55 °C. These hybridisation 
conditions may be used in the context of defining nucleic 
acid sequences which hybridize with GPI-PLD nucleic acid 
sequences . 

Uses Of GPI-PLD Nucleic Acid 
15 The GPI-PLD nucleic acid sequences can be used in the 

preparation of cell lines capable of expressing GPI-PLD 
and in gene therapy techniques. 



10 
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Thus, the present invention provides a cell line for 
transplantation into a patient, the cell line being 
transformed with nucleic acid encoding GPI-PLD, and being 
capable of expressing and secreting GPI-PLD. in one 
embodiment, the cell lines are encapsulated, e.g. in a 
biocompatible polymer, so that the GPI-PLD produced by 
25 the cell line can be secreted into the patient, while 
preventing rejection by the immune system of the host. 
Methods for encapsulating cells in biocompatible polymers 
are described in W093/16687 and W096/31199. 



As a further alternative, the nucleic acid encoded the 
GPI-PLD protein could be used in a method of gene 
therapy, to treat a patient who is unable to synthesize 
the active polypeptide or unable to synthesize it at the 
normal level, thereby providing the effect provided by 
wild- type GPI-PLD protein and suppressing the occurrence 
of diabetes in the target cells. 
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vectors such as viral vectors have been used in the prior 
art to introduce genes into a wide variety of different 
target cells. Typically, the vectors are exposed to the 
target cells so that transfection can take place in a 
sufficient proportion of the cells to provide a useful 
therapeutic or prophylactic effect from the expression of 
the desired polypeptide. The transfected nucleic acid 
may be permanently incorporated into the genome of each 
of the targeted tumour cells, providing long lasting 
effect, or alternatively the treatment may have to be 
repeated periodically. 

A variety of vectors, both viral vectors and plasmid 
vectors, are known in the art, see US Patent No: 
5,252,479 and WO93/07282. In particular, a number of 
viruses have been used as gene transfer vectors, 
including papovaviruses , such as SV40, vaccinia virus, 
herpesviruses, including HSV and EBV, and retroviruses. 
Many gene therapy protocols in the prior art have used 
disabled murine retroviruses. 

As an alternative to the use of viral vectors other known 
methods of introducing nucleic acid into cells includes 
electroporation, calcium phosphate co -precipitation, 
mechanical techniques such as microinjection, transfer 
mediated by liposomes and direct DNA uptake and receptor - 
mediated DNA transfer. 

As mentioned above, the aim of gene therapy using nucleic 
acid encoding the GPI-PLD protein, or an active portion 
thereof, is to increase the amount of the expression 
product of the nucleic acid in cells in which the level 
of the wild-type GPI-PLD protein is absent or present 
only at reduced levels. Target cells for gene therapy 
include insulin secreting (3 -cells or any neuron derived 
cells. Cell engineering can be used to provide the 
overexpression or repression of GPI-PLD protein in 
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trans fected cell lines which can then be subsequently 
transplanted to humans. Gene therapy can be employed 
using a promoter to drive GPI-PLD protein expression in a 
tissue specific manner (i.e. an insulin promoter linked 
to GPI-PLD cDNA will overexpress GPI-PLD protein in p- 
cells and transiently in the brain) . If defective 
function of GPI-PLD protein is involved in neurological 
disease, GPI-PLD protein can be overexpressed in 
transformed cell lines for transplantation. 

Gene transfer techniques which selectively target the 
GPI-PLD nucleic acid to target tissues are preferred. 
Examples of this included receptor -mediated gene 
transfer, in which the nucleic acid is linked to a 
protein ligand via poly lysine, with the ligand being 
specific for a receptor present on the surface of the 
target cells. 

Diagnostic Methods 

Methods for determining the concentration of analytes in 
biological samples from individuals are well known in the 
art and can be employed in the context of the present 
invention to determine the presence or amount of GPI-PLD 
in a biological sample from a patient. This in turn can 
allow a physician to determine whether a patient suffers 
from diabetes or diabetic complications, and so optimise 
the treatment of it. 

Broadly, the methods divide into those screening for the 
presence of GPI-PLD protein nucleic acid sequences and 
those that rely on detecting the presence or absence of 
the GPI-PLD protein polypeptide. The methods make use of 
biological samples from individuals that are suspected of 
contain the nucleic acid sequences or polypeptide. 

These diagnostic methods can employ biological samples 
such as blood, serum, tissue samples or urine. In view 
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of the fact that the activity of GPI-PLD is thought to be 
due to the level of the enzyme circulating in serum, the 
use of serum or blood samples is preferred. 

The assay methods for determining the amount or 
concentration of GPI-PLD protein typically either employ 
binding agents having binding sites capable of 
specifically binding to GPI-PLD in preference to other 
molecules or measure a characteristic biological activity 
of GPI-PLD. Examples of binding agents include 
antibodies, receptors and other molecules capable of 
specifically binding the enzyme. Conveniently, the 
binding agent (s) are immobilised on solid support, e.g. 
at defined locations, to make them easy to manipulate 
during the assay. 

The sample is generally contacted with the binding 
agent (s) under appropriate conditions so that GPI-PLD 
present in the sample can bind to the binding agent (s). 
The fractional occupancy of the binding sites of the 
binding agent (s) can then be determined using a 
developing agent or agents. Typically, the developing 
agents are labelled (e.g. with radioactive, fluorescent 
or enzyme labels) so that they can be detected using 
techniques well known in the art. Thus, radioactive 
labels can be detected using a scintillation counter or 
other radiation counting device, fluorescent labels using 
a laser and confocal microscope, and enzyme labels by the 
action of an enzyme label on a substrate, typically to 
produce a colour change. The developing agent (s) can be 
used in a competitive method in which the developing 
agent competes with the analyte for occupied binding 
sites of the binding agent, or non- competitive method, in 
which the labelled developing agent binds analyte bound 
by the binding agent or to occupied binding sites. Both 
methods provide an indication of the number of the 
binding sites occupied by the analyte, and hence the 



concentration of the analyte in the sample, e.g. by 
comparison with standards obtained using samples 
containing known concentrations of the analyte. 

Experimental 

The present invention is based on the realisation that 
GPI-PLD is responsible for the production of IPG second 
messengers following binding of insulin to its receptor. 
The IPGs then interact with other cellular enzymes 
instigating some of the metabolic effects of the hormone. 
In view of this, insulin resistance may be caused by 
deficiencies in GPI-PLD; it has shown that pancreatic 
islet cells produce and secrete GPI-PLD, which is 
transported in blood complexed with apolipoprotein Al, 
and may therefore represent the major source of 
circulating enzyme. If this is indeed the case then the 
insulin resistance seen in early type I diabetes mellitus 
(IDDM) may result from decreased circulating GPI-PLD 
levels. This may have direct therapeutic relevance in 
that co-infusion of insulin with GPI-PLD may in fact be a 
far more effective therapy for diabetic patients than 
insulin. 

Screening of human liver cDNA library 

A human liver cDNA library (Gibco BRL, cat # 10422-012, 
lot # HF4703) was screened for GPI-PLD, resulting in the 
isolation of 3 cDNA clones. The nucleic acid sequences 
of the clones are shown in figures 4 to 6 , with the 
deduced amino acid sequences shown in figure 3. 

Clone al represents the full length cDNA. There are only 
two differences within the coding region of this sequence 
when compared to that of the human GPI-PLD pancreatic 
form described in the GenBank database (accession number 
L11702) . These are a g to a conversion at positions 88 
(L11702) , 199 (al) and a t to g conversion at positions 
797 (L11702) , 908 (al). Interestingly this latter this 
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latter conversion creates a unique Hindlll restriction 
site in the al clone. Both conversions result in amino 
acid differences, the first changes amino acid 30 from a 
valine in L11702 to an isoleucine in al, and the second 
changes amino acid 266 from an isoleucine in L11702 to a 
serine in al. Clone al also differs from L11702 in that 
it contains 5' untranslated region (UTR) and only shares 
the first 168 bases of the 3' UTR before terminating in a 
poly-A tail. 

Clone b2 lacks the exon of GPI-PLD, which begins at 
position 2469 in the al nucleotide sequence. However, 
the sequence from here to the end of b2 (2444-2473) does 
not contain a stop codon. It is therefore not clear 
whether b2 represents a cDNA with a different final exon 
or is the produce of aberrant processing. 

Clone d3 shared the coding 3' UTR sequence of the al 
clone from al position 1119 onwards, however the initial 
1008 base pairs of coding sequence are absent from this 
clone. Clone d3 contains a methionine initiation codon 
in frame to the coding sequence at position 202 and a 
unique 5' UTR. Translation of d3 from this codon would 
result in a unique sequence of 6 amino acids (1-6) . 
Clone d3 therefore appears to represent a true 
transcript, in that it contains initiation and stop 
codons and both 5' and 3' UTRs . The predicted protein 
product of this transcript would apparently lack the 
catalytic domain, which has been localised to the N- 
terminus of the GPI-PLD enzyme (amino acids 1-375), 
however the 3 EF hand- like domains would still be 
present. 

Huang et al and Tsang et al (1992) reported that two 
variants or isoenzymes of GPI-PLD exist, the so-called 
liver and pancreatic forms (accession numbers L11701 and 
11702) . Other workers have detected L11702 cDNAs in 
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human breast, eye, spleen, tonsil, and pancreas, as well 
as in liver. However, we failed to detect the liver form 
of GPI-PLD in the liver or in any other tissues. 

Gene mapping and localisation 

The chromosomal gene isolated in the experiments above is 
about 20-30 kb in length. The gene was also isolated on 
a PAC and mapped by fluorescence -in situ hybridisation 
(FISH) to 6p21.3, agreeing with recent radiation hybrid 
maps as seen on GeneMap'98; NCBI) . The IDDM1 
susceptibility gene also maps to 6p21.3, although recent 
evidence suggests that at least two closely- linked loci 
for IDDMl are in the MHC region. The MHC locus itself 
seems to map to a region adjoining the GPI-PLD locus 
rather than within the same microsatellite band, so the 
significance of the proximity of the GPI-PLD and IDDMl 
loci is unclear. 

Northern blots of the mRNA species found in liver have 
shown two presumed splice variants as well as the full- 
length transcript. One has a deletion of about 160 amino 
acids from the mature 817 amino acid protein. The other 
seems to be a C- terminal deletion, which may therefore be 
non- functional if other authors are correct in finding 
that the C- terminus is necessary for enzyme activity. 

The predominant GPI-PLD species detected after tissue 
extraction by antibodies (Western blots) has apparent 
molecular weight of about 47 kD, which agrees with other 
authors that full-length GPI-PLD is taken up from the 
plasma and processed to smaller active species. 

GPI-PLD obtained from serum by cells is required for 
second messenger signalling 

The principle goal of these experiments was to determine 
the role of glycosylphosphatidylinositol phospholipase D 
(GPI-PLD) in a type one hypersensitivity reaction. This 



reaction involved the cross - linking of IgE receptors on 
the mast cell surface, leading to the release of allergic 
mediators . 

Such an allergic reaction has been experimentally 
reproduced in our laboratory, using a rat basophilic 
leukaemia cell line, RBL-2H3 . These cells naturally have 
unoccupied IgE receptors (FceRl, or high-affinity 
receptors) , allowing them to be passively sensitised with 
an IgE isotype of choice. 

RBL-2H3 cell culture was maintained in Eagles minimum 
essential medium, containing 10% Foetal Bovine Serum 
(FBS) (heat activated) , 100 U/ml Penicillin, 100 ug/ml 
Streptomycin and 2 mM L-glutamine. 

Previous research indicates that RBL-2H3 cells derive 
their GPI-PLD from the culture serum (data not shown) . 
Therefore, it follows that inactivation of this external 
source of GPI-PLD would deprive the cells of any further 
enzyme . 

Inactivation of GPI-PLD activity in foetal bovine serum 
was achieved according to the method of Kung et al 
(Biochimica et Biophysica Acta, 1357:329-338, 1997). 
Briefly, FCS was adjusted to pH 11 using concentrated 
hydrochloric acid, and incubated for 1 hour at 37 °C 
using. After this time, the pH was adjusted to 7.4, and 
GPI-PLD activity was determined using an enzymatic assay 
(Davitz et al, J. Biol. Chem. , 264:13760-13764, 1989). 
Results indicated that this alkaline incubation severely 
depleted GPI-PLD activity (data not shown) . 

To determine the effect of culture of RBL-2H3 cells in 
GPI-PLD inactive serum, the supplemented MEM was replaced 
with MEM in which the FBS had been inactivated. Although 
the cell appearance was not dramatically altered by the 



27 

altered culture conditions, determination of GPI-PLD 
activity showed a dramatic reduction in activity. 

GPI-PLD activity in cells cultured with GPI-PLD 
5 active/inactive FBS: 

Active = 0.66 units GPI-PLD activity /mg of protein. 

Inactive = 0.11 units GPI-PLD activity /mg of protein. 

10 

The effect of a reduced GPI-PLD activity on the cell's 
ability to respond to IgE cross - linking was determined as 
follows : 

15 RBL-2H3 cells were grown to confluence, after which time 

the adherent cells were removed from the culture flask 
using a cell scraper. The cell density was determined, 
using a haemocytometer , and adjusted to 2 x 10 5 per ml. 
The cells were seeded at 1 ml per well in a 24 well 

20 culture plate and cultured for overnight at 37 °C in a 

humidified 5% C0 2 incubator. 

The overnight culture media was aspirated and replaced 
with fresh media containing Rat IgE anti-DNP 3pg/ml. 

25 After a 2 hour incubation period, the media was 

aspirated, and the cells were washed twice, with HEPES 
Buffered Saline. Cross - linking was achieved by the 
addition of 200 yl of DNP -Albumin at 100 ng/ml, and 
incubation for 2 hours. Mediator release was determined 

30 using a colorimetric assay to detect the presence of p- 

hexosaminidase and compared with the total cell (3- 
hexosaminidase content (as determined by incubation with 
200 ul 5% Triton X-100 detergent) . (Yasuda et al, Int. 
imunol., 7:251-258, 1995). As shown in the table below, 

35 the responsiveness to cross -linking was significantly 

reduced in those cells that were cultured in GPI-PLD 
inactive media. 
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Percentage release in igE linking activity assay 
(compared with total) 

Active GPI-PLD culture = 48.79% 

inactive GPI-PLD culture = 5,07% 

Phosphorylation of GPI-PLD 

The phosphorylation state of the GPI-PLD enzymes can be 
determined using MALDI-TOF mass spectrometry as described 
by Yip & Hutchins (1992) . Spectrums of tryptic digests 
of the four proteins can be compared before and after 
treatment with calf intestinal alkaline phosphatase. The 
specific kinases responsible for phosphorylation of GPI- 
PLD can then be determined by incubation of the GPI-PLD 
tryptic fragments with ATP in the presence of various 
kinases. Motif analysis of the amino acid sequence of 
human GPI-PLD using the HGMP "motif " package has revealed 
the presence of numerous potential phosphorylation sites 
for two enzymes: protein kinase C and protein kinase ck2 
(formerly known as casine kinase II) . These enzymes may 
therefore be involved in the activation of GPI-PLD. 
intriguingly the activity of protein kinase ck2 has been 
shown to be modulated by IPGs (Alemany et al, 1990) and 
there is also indirect evidence suggesting that IPGs may 
act through protein kinase C, thus suggesting the 
possibility of feedback loops regulating the production 
of IPGs. 

GPI-PLD as a metal ion transferase 

Two families of IPGs exist. IPGs of the P-type stimulate 
incorporation of glucose into glycogen whereas the A- type 
IPGs stimulate incorporation of glucose into lipid. 
Metal ion analysis has shown that the P-type IPGs contain 
manganese and the A- type zinc. It is known that the 
serum form of GPI-PLD contains approximately 10 atoms of 
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zinc per molecule . Investigation can therefore show 
whether the different isoforms of human GPI-PLD produce 
IPGs with differing metal ion content. 




5 This experiment can be performed in two ways. Firstly 

purified A -type and P-type IPGs can be extracted from rat 
liver (Caro et al, 1997) and their metal ions removed 
using dithiazone in chloroform. The IPGs can be 
incubated in the presence of radiosotopes of zinc ( 65 Zn 2+ ) 

10 and manganese ( 52 Mn 2+ ) respectively. The radiolabelled 

IPGs can then be added to the different isoforms of 
•purified GPI-PLD (as determined in the above experiments) 
in the absence of GPI substrate thus driving the reaction 
from product (IPG) to substrate (GPI) . It can then be 

15 determined whether or not the GPI-PLD protein have 

incorporated radioactive metal ions from the IPGs. The 
reverse situation will also be examined, whereby the 
metal ions of GPI-PLD isoforms are replaced by the 
respective radioisotopes. GPI-PLD can then be incubated 

20 with GPIs extracted from membrane preparations and the 

resulting IPG products analysed for incorporation of 
radioisotope. These experiments will thus determine 
whether or not GPI-PLD is responsible for the transfer of 
divalent cations (Mn 2+ or Zn 2+ ) to its IPG products. 

25 

The significance of co-secretion of GPI-PLD and insulin 
from islet cells 

It has been reported that the pancreatic islet B cell- 
derived mouse cell line BTC3 co- secretes insulin and GPI- 

30 pld. Whilst the kinetics of secretion were somewhat 

different, both proteins were co- localised in secretory 
granules and their release was stimulated by glucose and 
a variety of secretagogues (Deeg & Verchere, 1997) . The 
relevance of these observation to the present invention 

35 can be investigated using streptozotocin- induced diabetic 

rats. Streptozotcin results in destruction of the islets 
of Langerhans and thus these animals cannot produce 
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insulin and represent an animal model of type I diabetes 
mellitus (IDDM) . 

GPI-PLD levels can first be compared in the serum of 
5 streptozotocin- treated and control rats over time. If 

levels are consistently lower following streptozotocin 
treatment this would imply that the islets of Langerhans 
are indeed the major source of GPI-PLD found in serum. 
The ability of streptozotocin- treated and control rats to 

10 clear glucose following administration of either insulin 

alone or insulin combined with varying doses of GPI-PLD 
can also be examined. The GPI-PLD will be supplied bound 
to apolipoprotein Al (apo Al) as it is in normal serum. 
If the addition of GPI-PLD increased the ability of 

15 insulin to reduce blood glucose, then this will have 

important implications in the treatment of diabetic 
patients . 

Site of action 

20 The function of the enzyme in releasing GPI- anchored 

proteins, and its postulated function as the generator of 
IPG second messengers require the enzyme to be active at 
the cell surface. It is known that GPI -anchored proteins 
accumulate in clusters in caveolae, an uncoated pit 

25 membrane specialisation, and so this is a good potential 

site for GPI-PLD activity. Analysis of the primary 
structure of the protein predicts a secondary structural 
arrangement of four amphipathic helices, thus suggesting 
that the protein can interact with lipids in membranes. 

30 Previous experiments have demonstrated significant 

amounts of the enzyme in the lyososmal fraction but not 
in the cytosol. The location of GPI-PLD will be examined 
by staining tissues with anti -GPI-PLD antibodies, 
followed by a gold particle- labelled second antibody. 

35 Tissue can then be prepared for transmission electron 

microscopy and the location of the GPI-PLD protein 
determined. Caveolae will also be produced according to 



the protocol of Chang et al (1994) , which involves three 
rounds of sucrose step gradient ultracentrifugation. 
Caveolae- enriched proteins will then be separated by SDS- 
PAGE and electrophorectically transferred to 
nitrocellulose membranes . We can then use the anti-GPI- 
PLD antibody to determine if GPI-PLD is present in these 
membrane specialisations . 

Activation of GPI-PLD 

If GPI-PLD is found to be phosphorylated by protein 
kinase C and/or protein kinase ck2 by MALDI-TOF 
spectrometry, the interaction of these proteins can be 
confirmed using immunoprecipitation since antibodies to 
GPI-PLD, protein kinase C and protein kinase ck2 have all 
be produced. The yeast two hybrid system can also be 
used to identify other proteins which interact with GPI- 
PLD in the cell. The yeast two hybrid systems (Chen et 
al, 1991) is based on the property of the yeast 
transcriptional activator Gal4, which is separable into 
DNA binding and transcriptional activating domains. GPI- 
PLD cDNAs can be cloned in frame into the DNA binding 
domain vector. This will be co- transf ected into an 
appropriate yeast host strain along with a library of 
cDNAs cloned into the activation domain vector. 
Interaction of a protein with GPI-PLD will therefore 
result in localisation of the activation and DNA binding 
domains, and hence transcription of the galactosidase 
reporter gene. Clones containing interacting proteins 
are then identified by the colour reaction they produce. 
The advantage of this system is that the gene encoding 
the interacting protein is immediately available for 
sequence analysis and thus identification. The use of 
this system has enabled identification of many 
interacting proteins and the system available in kit form 
from Clontech. This also provides a method of screening 
for sustances which are capable of activating GPI-PLD, 
e.g. for further development as lead compounds. 



Discussion 

GPI-PLD is a metalloenzyme with 5 and 10 atoms per 
molecule of calcium and zinc, respectively. It 
circulates in a complex with apolipoprotein Al . GPI-PLD 
is produced in the pancreas by both a and (5 -cells in the 
islets of Langerhans. It is also produced by a mouse 
insulinoma cell line (TC3) , with GPI-PLD and insulin 
generally colocalised intracellularly . The enzyme was 
shown to be secreted in response to insulin 
secretagogues . Both isoenzymes of GPI-PLD also seem to 
be present in liver; a major part of the activity could 
be washed away from the tissue by extraction with 
detergent -free buffer (thus, likely to be the plasma 
enzyme) • There is some suggestions that the liver, as 
well as the pancreas, may contribute to the serum pool of 
GPI-PLD as patients with liver disease have lower levels 
of active enzyme, which is correlated with the reduced 
albumin levels. 

It has been shown that streptozotcin- induced diabetes 
mellitus in the rat reduced the basal content of insulin- 
sensitive PIG in isolated hepatocytes by about 60%. The 
authors conclude that insulin resistance in these rats is 
related to the impairment of PIG metabolism. It has 
also been shown that the mRNA for a GPI-PLD -like gene was 
over expressed in genetically obese (ob/ob) mice in 
comparison to lean litter mates. In the context of the 
invention described herein, this latter finding suggests 
that GPI-PLD levels are responsive to the obese/diabetic 
genotype . 
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Figure 1: Alignment of GPI-PLD deduced amino acid sequences 
Top: protein produced from cDNA clone Al 

Mid: protein produced from Roche patent bovine liver sequence 
Bot: protein produced from Roche patent human liver sequence 



MSAFRLWPGLLIMLG-SLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDA 
MS AFR FWS GLLMLLG- FLC PRS S PCGI S THI E I GHRALE FLHLQDGS I NYKE LLLRHQDA 
MS AFRLWPGLLMI VMAS LCHRGS S CGLSTHI E I GHRALE FLHLHNGHVNYKE LLLEHQDA 

YQAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNT^SVHYIRENYPLPWEKDTEKLVAFL 
YQAGSVFPDSFYPSICERGQFHDVSESTHWTPFLNASVHYIRKNYPLPWDEDTEKLVAFL 
YQAGTVFPDCFYPSLCKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFL 

FGITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLA 
FGITSHMVADVNWHSLGIENGFLRTMAAIDFHNSYPEAHPAGDFGGDVLSQFEFKFNYLS 
FGITSHMVADVSWHSLGIEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLA 

RRWYVPVKDLLGIYEKLYGRKVITEhA^IVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFL 
RHWYVPAEDLLGIYRELYGRIVITKKAIVDCSYLQFLEMYAEMLAISKLYPTYSVKSPFL 
RRWYVPVKDLLGIYEKLYGREVITENVIVBCSHIQFLEMYGEMLAVSKLYPSYSTKSPFL 

VEQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPENPLFIACGGQQNHTQG 

VEQFQEYFLGGLEDMAFWSTNIYHLTSTMLKNGTSNCNLPENP LFITCGGQQNNTHG 

VEQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCSLFENPENPLFIACGGQQNHTQG 

SKMQKNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKfliERNIRTMFIG 
SKVQKNGFHKNVTAALTKNI GKHINYTKRGVFFS VDSWTMDFLS FMYKS LERS I REMFI G 

SKMQKNDFHRNLTSSLTENIDRNINYTERGVFFSVNSWTPDSMSFIYKALERNVRTMFIG 

GSQLSQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRV 
SSQP-LTHVSSPAASYYLSFPYTRLGWAMTSADLNQDGYGDLWGAPGYSHPGRIHVGRV 
GSQLSQKHISSPIASYFLSFPYARLGWAMTSADLNQDGYGDLWGAPGYSRPGRIHIGRV 

Y LI Y GNDLGLP PVDLDLDKEAHRI LE GFQPS GR FGS ALAVLD FNVDGVPDLAVGAPS VGS 
YLIYGNDLG-PRIDLDLDKEAHGILEGFQPSGRFGSAVAVLDFNVDGVPDLAVGAPSVGS 
YLIYGNELGLPPVDLDLDKEAHGILE GFQPS GR FGS ALAMLDFNMDGVPDLAVGAPS VGS 

EQLTYKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPD-LVIGSP 
E KLT YTGAVYVY FGSKQGQLS S S PNVT I S CQDT YCNLGVTTLLAADVDGDSE PDLFVI GS P 
EQLTYKGAVYVYFGSKQGRMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPD-LVIGSP 

FAPGGGKQKGIV7VAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLL 
FAFGGGKQKGIVAAFYSGSSYSSREKIJ^EAANWMVKGEEDFAWLGYSLHGVimmRTLL 
FAPGGGKQKGI VAAFYS GPS LSNKEKLNVEAANWTVRGEEDFAWFGYS LHGVTVDNRTLL 

LVGSPTWKNASRLGHLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSSGH 
LAGS PTWKDTSSQGHLFRTRDEKQSPGRVYGYFPPICQSWFTISGDKAMGKLGTSLSSGH 
LVGSPTWKNASRLGRLLHIRDEKKSLGRVYGYFPPNSQSWFTIVGDKAMGKLGTSLSSGH 

VLMNGTLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRF 
VIVNGTRTQVLLVGAPTQDWSKS-FLTMTLHQGGSTRMYELTPDSQPSLLSTFSGNRRF 
VLMNGTLTQVLLVGAPTRDDVSKMAFLTMTLHQGGATRMYALTSDLQPPLLSTFSGDRRF 

SRFGGVLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKC 
SRFGGVLHLSDLDNDGLDEIIVAAPLRITDATAGLMGEEDGRVYVFNGKQITVGDVTGKC 
SRFGGVLHLSDLDDDGVDEIIVAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKC 

KSWITPCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVY 
KSWVTPCPEEKAQYVLISPEAGSRFGSSVITVRSKEKNQVIIAAGRSSLGARLSGVLHIY 
KSWMTPCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVY 
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Figure 2: Alignment of human GPI-PLD nucleic acid sequences 



Top: pancreatic- form cDNA sequence from GenBank database 
mid: our sequence cloned from human liver cDNA library 
hot: Roche patent pancreatic-form partial cDNA sequence 



1 



GTGACCTGCTTAGAGAGAAGCGGTGGGTCTGCACCTGGATTTTGGAGTCCCAGTGCTGCT 



60 



1 



■ATGTCTGCT 



9 



6 1 GCAGCT CT GAGCAT TC CCAC GT CAC CAGAGAAGCC GGT GGGCAAT GAGAGCAT GTCT GCT 120 



1 0 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 6 9 
121 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 180 



7 0 T GT GGC CTTTCAACACAC GT AGAAATAGGACAGAGAGCT CT GGAGTTT CTT CAGCTT CAC 12 9 
181 T GT GGC CTTT GAACAGACATAGAAATAGGAGACAGAGCT CT GGAGTTT CTT GAGCTTGAC 240 



130 AAT GGGC GT GTTAACTAGAGAGAGCTGTTACTAGAACAC CAGGAT GCGTAT CAGGCT GGA 189 
241 AAT GGGC GT GTTAACTAGAGAGAGCT GTTACTAGAACAC CAGGAT GCGTAT CAGGCT GGA 300 



190 ATCGTGTTTCCTGATTGTTTTTACCCTAGCATCTGCAAAGGAGGAAAATTCCATGATGTG 249 
301 AT C GT GTTT C CT GATT GTTTT TAC C CTAGCAT CTGCAAAGGAGGAAAATT C CAT GAT GT G 360 



250 TCT GAGAGCACT CACT GGACT C CGTTT CTTAAT GCAAGCGTT CATTATAT C CGAGAGAAC 309 
361 T CT GAGAGCACT CACT GGACT C CGTTT CTTAAT GCAAGC GTT CATTATAT C C GAGAGAAC 420 



310 TAT CCC CTT CCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTT CTT GTTT GGAATTACT 369 
421 TAT CCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTT CTT GTTT GGAATTACT 480 



370 T CTCACAT GGC GGCAGAT GT CAGCT GGCATAGT CT GGGC CTT GAACAAGGATT CCTTAGG 42 9 
481 T CT CACAT GGC GGCAGAT GT CAGCT GGCATAGT CT GGGC CTT GAACAAGGATT C CTTAGG 540 



430 AC CAT GGGAGCTATT GATTTT CACGGCT CCTATT CAGAGGCT CATTCGGCT GGT GATTTT 48 9 
541 AC CAT GGGAGCTATT GATTT T CACGGCT CCTATT CAGAGGCT GATT CGGCT GGT GATTTT 600 



490 GGAGGAGAT GT GTT GAGC CAGT TTGAATTTAATTTT AATT ACCTT GCAC GACGCT GGT AT 54 9 
601 GGAGGAGAT GT GTT GAGC CAGTTT GAATTTAATTTT AATTACCTT GCAC GACGCT GGT AT 660 



550 GT GCCAGT CAAAGAT CT ACT GGGAATTT AT GAGAAACT GT AT GGT CGAAAAGT CAT CACC 609 
661 GT GCCAGT CAAAGAT CT ACT GGGAATTTAT GAGAAACT GT AT GGT CGAAAAGT CAT CAC C 72 0 



610 GAAAAT GTAAT CGTT GATT GTT CACAT AT CCAGTT CTT AGAAAT GTAT GGT GAGAT GCTA 669 
721 GAAAAT GTAAT CGT T GAT T GTT CACAT AT C CAGTT CTTAGAAAT GTAT GGT GAGAT GCTA 7 8 0 



670 
781 



GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 
GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 



729 
840 
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730 CAAGAGTATTTTCTTGGAGGACTGGATGATATGGCATTTTGGTCCACTAATATTTACCAT 78 9 
841 CAAGAGTATTTTCTTGGAGGACTGGATGATATGGCATTTTGGTCCACTAATATTTACCAT 900 



790 CTAACAATCTTCATGTTGGAGAATGGGACCAGTGACTGCAACCTGCCTGAGAACCCTCTG 849 
901 CTAACAAGCTTCATGTTGGAGAATGGGACCAGTGACTGCAACCTGCCTGAGAACCCTCTG 960 



850 T T CAT T GCAT GT GGC GGC C AGCAAAAC CACAC C CAGGGCT CAAAAAT GCAGAAAAAT GAT 90 9 
961 T T CAT T GCAT GT GGC GGC C AGC AAAAC CACAC C C AGGGC T CAAAAAT GCAGAAAAAT GAT 1020 



910 TTTCACAGAAATTTGACTACATCCCTAACTGAAAGTGTTGACAGGAATATAAACTATACT 969 
1021 TTTCACAGAAATTTGACTACATCCCTAACTGAAAGTGTTGACAGGAATATAAACTATACT 108 0 



970 GAAAGAGGAGTGTTCTTTAGTGTAAATTCCTGGACCCCGGATTCCATGTCCTTTATCTAC 1029 
1081 GAAAGAGGAGTGTTCTTTAGTGTAAATTCCTGGACCCCGGATTCCATGTCCTTTATCTAC 1140 



1030 AAGGCTTT GGAAAGGAACATAAGGACAATGTT CATAGGT GGCT CTCAGTT GT CACAAAAG 1089 
1141 AAGGC T T T GGAAAGGAACATAAGGACAAT GTT CAT AGGT GGCT CT CAGT T GT CACAAAAG 1200 



1090 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1149 
1201 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1260 



1150 GCAAT GAC CT CAGCT GACCT CAACCAGGAT GGGCAC GGT GACCT CGT GGTGGGC GCACCA 1209 
1261 GCAAT GACCT CAGCT GAC CT CAAC CAGGAT GGGCAC GGT GACCT CGT GGT GGGC GCACCA 1320 



1210 GGCTACAGCCGC C C C GGCCACAT C CACAT C GGGCGCGT GT AC CT CAT CT ACGGCAAT GAC 1269 
1321 GGCTACAGCCGC C CC GGC CACAT C CACAT C GGGC GC GT GT AC CT CAT CTAC GGCAATGAC 1380 



1270 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 1329 
1381 CTGGGCCTGCCACCTGTTGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 1440 



1330 TTCCAGCCCTCAGGTCGGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 138 9 
1441 TTCCAGCCCTCAGGTCGGTTT GGCT CGGCCTTGGCTGT GTT GGACTTTAACGTGGAC GGC 1500 



1390 GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 1449 
1501 GTGCCTGACCTGGCCGTGGGAGCTCCCTCGGTGGGCTCCGAGCAGCTCACCTACAAAGGT 1560 



1450 GCCGTGTATGTCTACTTTGGTTCCAAACAAGGAGGAATGTCTTCTTCCCCTAACATCACC 1509 
1561 GCCGT GTAT GT CT ACTTT GGTT C CAAACAAGGAGGAAT GT CTT CTT CCC CTAACAT CAC C 1620 



1510 ATTT CTT GCCAGGACAT CTACT GTAACTTGGGCT GGACT CT CTT GGCT GCAGAT GT GAAT 1569 
1621 ATTT CTTGC CAGGACAT CTACT GTAACTT GGGCT GGACT CT CTT GGCTGCAGATGT GAAT 1680 



1570 GGAGACAGT GAACC C GAT CT GGT CATC GGCT C C C CT T TT GCAC CAGGT GGAGGGAAGCAG 1629 
1681 GGAGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACCAGGTGGAGGGAAGCAG 1740 



1630 AAGGGAATT GTGGCT GCGTTTTATT CT GGC C CCAGC CT GAGC GACAAAGAAAAACT GAAC 1689 
1741 AAGGGAATT GT GGCT GCGTTTTATTCT GGC CCCAGCCT GAGC GACAAAGAAAAACTGAAC 1800 
1 CTGGCCCCAGCCTGAGC GACAAAGAAAAACTGAAC 35 



# 5/18 # 



169 0 GTGGAGGCAGCCAACTGGAC GGT GAGAGGC GAGGAAGACTT CT C CT GGTTT GGATATT CC 17 49 
18 01 GT GGAGGCAGC CAACT GGACGGT GAGAGGCGAGGAAGACTT CT CCT GGTTTGGATATT CC 18 60 
36 GTGGAGGCAGCCAACTGGACGGTGAGAGGCGAGGAAGACTTCTCCTGGTTTGGATATTCC 95 

1750 CTT CAC GGT GT CACT GT GGACAACAGAAC CTT GCT GTTGGTT GGGAGCCCGACCT GGAAG 1809 
1861 CTT CACGGTGT CACT GT GGACAACAGAAC CTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1920 
9 6 CTTCACGGTGTCACTGTGGACAAGAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 155 

1810 AAT GC CAGCAGGCT GGGC CATTTGTTACACAT C C GAGAT GAGAAAAAGAGCCTT GGGAGG 18 69 
1921 AAT GC CAGCAGGCT GGGC CATTT GTTACACAT CC GAGAT GAGAAAAAGAGCCTT GGGAGG 198 0 
156 AAT GC CAGCAGGCT GGGC CATTT GTTACACAT CC GAGAT GAGAAAAAGAGCCTT GGGAGG 215 

1870 GTGTAT GGCTACTT C CCAC CAAAC GGCCAAAGCT GGTTT AC CATTT CTGGAGACAAGGCA 1929 
1981 GT GT AT GGCTACTT C CCAC CAAAC GGCCAAAGCT GGTTTAC CATTT CT GGAGACAAGGCA 2040 
216 GT GTAT GGCTACTT CC - ACCAAACGGCCAAAGCT GGTT T AC CATTTCT GGAGACAAGGCA 275 

1930 AT GGGGAAACT GGGTACTTC CCTTT CCAGT GGC CAC GT ACT GAT GAAT GGGACT CTGAAA 1989 
2041 AT GGGGAAACT GGGTACTTC C CTTT CCAGT GGC CAC GT ACT GAT GAAT GGGACT CT GAAA 2100 
27 6 AT GGGGAAACT GGGTACTT C C CTTTCCAGT GGT CACGTACTGAT GAAT GGGACT CT GAAA 335 

1990 CAAGT GCT GCT GGTT GGAGC C C CTAC GTAC GAT GAC GT GTCTAAGGTGGCATTCCT GACC 2049 
2101 CAAGT GCT GCT GGTT GGAGCC C CTAC GTAC GAT GACGT GT CTAAGGT GGCATT C CTGACC 2160 
336 CAAGT GCT GCT GGTT GGAGCC C CTAC GTAC GAT GACGT GT CTAAGGT GGCATT C CT GACC 395 

2 050 GT GAC CCT ACACCAAGGC GGAGC CACT CGCAT GTACGCACT CAT AT CT GACGCGCAGCCT 2109 
2161 GT GAC C CTACACCAAGGC GGAGC CACT CGCAT GTAC GCACT CAT AT CT GAC GC GCAGCCT 2220 

3 96 GT GAC C CTACACCAAGGC GGAGC CACT C GC GT GTAC GCACT CAT AT CT GACGCGCAGCCT 455 

2110 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2169 
2221 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 2280 
456 CTGCTGCTCAGCACCTTCAGCGGAGACCGCCGCTTCTCCCGATTTGGTGGCGTTCTGCAC 515 

2170 T T GAGT GAC CT GGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCCC C CCT GAGGATA 2229 
22 81 TTGAGT GAC CT GGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCCCCCCT GAGGATA 2340 
516 TT GAGT GAC CT GGAT GAT GAT GGCTTAGAT GAAAT CAT CAT GGCAGCC C C CCT GAGGATA 575 

2230 GCAGAT GTAAC CTCT GGACT GATT GGGGGAGAAGAC GGCC GAGTATAT GTAT AT AAT GGC 2289 
2341 GCAGAT GTAAC CT CT GGACT GATT GGGGGAGAAGAC GGC CGAGT ATAT GTAT AT AAT GGC 2400 
57 6 GCAGAT GTAAC CT CT GGACT GATT GGGGGAGAAGAC GGCC GAGT ATAT GTAT AT AAT GGC 635 

2290 AAAGAGAC CACCCTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGATAACT C CAT GT CCA 2349 
2401 AAAGAGAC CACCCTT GGT GACAT GACT GGCAAAT GCAAAT CAT GGAT AACT CCAT GT CCA 2460 
636 AAAGAGAC CACCCTT GGT GACAT GACT GGCAAATGCAAAT CAT GGATAACT C CAT GT CCA 695 

2350 GAAGAAAAGGC C CAAT AT GTATT GATTT CT CCT GAAGC CAGCT CAAGGTTT GGGAGCTCC 2409 
2 461 GAAGAAAAGGCC CAATAT GTATT GATTT CTC CT GAAGC CAGCT CAAGGTTT GGGAGCT CC 2520 
696 GAAGAAAAGGCGCAATAT GTATT GATTT CT CCT GAAGC CAGCT CAAGGTTT GGGAGCT CC 755 

2 410 CT CAT CACC GT GAGGT C CAAGGCAAAGAAC CAAGT C GT CATT GCT GCTGGAAGGAGTT CT 2469 
2521 CT CAT CACC GT GAGGT C CAAGGCAAAGAAC CAAGT C GT CATT GCT GCT GGAAGGAGTT CT 2580 
7 56 CT CAT CACC GT GAGGT C CAAGGCAAAGAAC CAAGT C GT CATT GCT GCT GGAAGGAGTT CT 815 

2 470 TT GGGAGC C C GACT CT C C GGGGCACTT CAC GT CTAT AGC CTT GGCT CAGATT GAAGATTT 2529 
2581 T T GGGAGCC C GACT CT C CGGGGCACTT CAC GT CTAT AGCCTT GGCT CAGATT GAAGATTT 2 640 
816 TTGGGAGCCCGACTCTCCGGGGCACTTCACGTCTATAGCCTTGGCTCAGATTGAAGATTT 875 

253 0 CACT GCATTT C CCCACT CT GC C CAC CT CT CT CAT GCT GAAT C ACAT C CAT GGTGAGCATT 258 9 
2641 CACT GCATTT C CCCACT CT GC C CACCT CT CT CAT GCT GAAT CACAT CCAT GGT GAGCATT 27 00 
876 CACT GCATT T C C C CACT CT GC C CACCT CT CTCAT GCT GAAT CACAT CCAT GGT GAGCATT 935 



2590 T T GAT GGACAAAGT GGCACAT C CAGT GGAGC GGT GGT AGAT CCT GAT AGACAT GGGGCT C 2649 
2701 T T GAT GGACAAAGT GGCACAT C CAGT GGAGC GGT GGT AGAT CCT GAT AGACAT GGGGCT C 27 60 
936 T T GAT GGACAAAGT GGCACAT C CAGT GGAGC GGT GGT AGAT CCT GAT AGACAT GGGGCT C 995 




2650 CTGGGA 2655 

2761 CTGGGA 2766 

99 6 CTGGGACAGTGAACCCGATCTGGTCATCGGCTCCCCTTTTGCACC^GGTGGAGGGAAGCA 1055 

2 656 GT AGAGAGACACACTAACAGC CACAC C C T C T G 2687 

27 67 GT AGAGAGACACACTAACAGC CACAC C CT CT G 2798 

1056 GAAGGGAAT T GT GGC T GC GT T T T ATT GAGT AGAGAGACACACTAACAGC CACAC C CT CT G 1115 

2 688 GAAAT CT GATACAGTAAATATAT GACTGCAC CAGAAATAT GT GAAATAGCAGACATTCTG 2747 

27 99 GAAAT C T GATACAGTAAATATAT GACTGCAC CAG 2833 

1116 GAAAT CT GATACAGTAAATATAT GACTACAC CAGAAATAT GT GAAAT AGCAGACATT CTG 1175 

2748 CTTACTCATGTCTCCTTCCACAGTTTACTTCCTCGCTCCCTTTGCATCTAAACCTTTCTT 2807 

1176 CTTACTCATGTCTCCTTCCACAGTTTACTTCCTCGCTCCCTTTGCATCTAAACCTTTCTT 1235 

28 08 CTTTCCCAACTTATTGCCTGTAGTCAGACCTGCTGTACAACCTATTTCCTCTTCCTCTTG 28 67 

1236 CTTTCCCAACTTATTGCCTGTAGTC 12 61 

28 68 AATGTCTTTCCAGTGGCTGGAAAGGTCCCTCTGTGGTTATCTGTTAGAACAGTCTCTGTA 2927 



2 928 CACAATT CCT C CTAAAAACAT C CTTTTTTAAAAAAAGAATT GTT CAGC CATAAAGAAAGA 2987 



2 9 88 ACAAGAT CAT GC C CTTT GCAGGGACAT GGAT GGAGCT GGAGGCCATTAT C CT T CATAAAC 3047 



3048 TATT GCAGGAACAGAAAAC CAAACACTC CATATT CT CACTT GTAAGT GGGAGCTAAGT GA 3107 



3108 GAACACGTGGACACAT AGAGGGAAACAACACACACT GGGGC CT AT GAGAGGGC GGAAGGT 3167 



3168 GGGAGGAGGGAGAGAT CAGGAAAAATAACTAAT GGAT ACTTAGGGT GAT GAAAT AAT CT G 3227 



3228 T GTAACAAAC CC CCAT GACACACCTTTAT GTAT GTAACAAAC CAGCACTT CCT GC GCAT G 3287 



3288 TACC CCT GAACTTAAAAGTTAAAAAAAAGTT GAACTTAAAAAT AACAGATT GGC C CAT GC 3347 



3348 



CAAT CAAAGTAT AAT AGAAAGCAT AGTATAC 3378 



Figure 3: 'Amino acid sequences of GPI-PLD al, b2 and d3. 
cDNA clone d3 

MILLFQDSMSFIYKALERNIRTMFIGGSQLSQKHVSSPLASYFLSFPYARLGWAMTSADL 
NQDGHGDLVVGAPGYSRPGHIHIGRVYLIYGNDLGLPPVDLDLDKEAHRILEGFQPSGRF 
GSALAVLDFNVDGVPDLAVGAPSVGSEQLTKGAVYVYFGSKQGGMSSSPNITISCQDIYC 
NLGWTLLAADVNGDSEPDLVIGSPFAPGGGKQKGIVAAFYSGPSLSDKEKLNVEAANWTV 
RGEEDFSWFGYSLHGVTVDNRTLLLVGSPTWKNASRLGHLLHIRDEKKSLGRVYGYFPPN 

GQSWFT I S GDKAMGKLGTS LS S GHVLMNGTLKQVLLVGAPT YDDVS KVAFLTVTLHQGGA 
TRMYALISDAQPLLLSTFSGDRRFSRFGGVLHLSDLDDDGLDEIIMAAPLRIADVTSGLI 
GGEDGRVYVYNGKETTLGDMTGKCKSWITPCPEEKAQYVLISPEASSRFGSSLITVRSKA 

KNQWIAAGRSSLGARLSGALHVYSLGSD 



cDNA clone b2 

MSAFRLWPGLLIMLGS LCHRGS PCGLSTH I EIGHRALEFLQLHNGRVNYRELLLEHQDAY 
OAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 
GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 
RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 
EQFQE Y FLGGLDDMAFWSTNI YHLTS FMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 
KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKALERNIRTMFIGGSQL 
SOKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 
GNDLGLPPVDliDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 
YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPDLVIGSPFAPGG 
GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 
TWKNASRLGHLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSSGHVLMNG 
TLKOVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 
VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 

PCPEEKVSEKKKKKK 



CDMA clone al 

MSAFRLWPGLLIMLGSLCHRGSPCGLSTHIEIGHRALEFLQLHNGRVNYRELLLEHQDAY 
QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 
GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 
RWYVPVKDLLGIYEKLYGRKVITENVIVDCSH1QFLEMYGEMLAVSKLYPTYSTKSPFLV 
EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 

KNDFHRNLTTS LTESVDRNI NYTE RGVFFS VNSWT PDSMS FI YKALERNI RTMFI GGS QL 
SQKHVSSPLASYFLSFPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHIGRVYLIY 
GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 
YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPDLVI GSPFAPGG 
GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLIiLVGSP 
TWKNASRLGHLLHIRDEKKSLGRVYGYFPPNGQSWFTISGDKAMGKLGTSLSSGHVIJ4NG 
TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 
VLHLSDLDDDGLDEIIMAAPLRIADVTSGLIGGEDGRVYVYNGKETTLGDMTGKCKSWIT 
PCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQWIAAGRSSLGARLSGALHVYSLGSD 



Figure 4: Human GPI-PLD cDNA clone al 
2832 bp: 690 a 688 c 735 g 719 t 

1 gtgacctgct tagagagaag cggtgggtct gcacctggat tttggagtcc cagtgctgct 
61 qcagctctga gcattcccac gtcaccagag aagccggtgg gcaatgagag catgtctgct 
121 ?tcaggttgt ggcctggcct gctgatcatg ttgggttctc tctgccatag aggttcaccg 
181 tgtggcctlt caacacacat agaaatagga cacagagctc tggagtttct tcagcttcac 
241 aatgggcgtg ttaactacag agagctgtta ctagaacacc aggatgcgta tcaggctgga 
301 atcqtqtttc ctgattgttt ttaccctagc atctgcaaag gaggaaaatt ccatgatgtg 
361 Jctgagagca ctcactggac tccgtttctt aatgcaagcg ttcattatat ccgagagaac 
421 tatccccttc cctgggagaa ggacacagag aaactggtag ctttcttgtt tggaattact 
til tScacatgg cggclgatgt cagctggcat agtctgggcc ttgaacaagg attccttagg 
541 accatgggag ctattgattt tcacggctcc tattcagagg ctcattcggc tggtgatttt 
601 ggaggagltg tgttgagcca gtttgaattt aattttaatt accttgcacg acgctggtat 
661 g?gccagtca aagatctact gggaatttat gagaaactgt atggtcgaaa agtcatcacc 
721 aaaaatqtaa tcgttgattg ttcacatatc cagttcttag aaatgtatgg tgagatgcta 
78^ gc?g^?cca ag?taLtcc cacttactct acaaagtccc cgtttttggt ggaacaattc 
841 caagagtatt ttcttggagg actggatgat atggcatttt ggtccactaa tatttaccat 
901 ctaacaagct tcatgttgga gaatgggacc agtgactgca acctgcctga gaaccctctg 
111 ^tcattgcat gtggcggcca gcaaaaccac acccagggct caaaaatgca ^aaaatgat 
1021 tttcacagaa atttgactac atccctaact gaaagtgttg acaggaatat aaactatact 
1081 gaaagaggag tgttctttag tgtaaattcc tggaccccgg attccatgtc ctttatctac 
1141 Iaggct??gl alaggaacat aaggacaatg ttcataggtg gctctcagtt gtcacaaaag 
1201 cacgtctcca gccccttagc atcttacttc ttgtcatttc cttatgcgag gcttggctgg 
1261 gcaatgacct Lgctgacct caaccaggat gggcacggtg acctcgtggt gggcgcacca 
1321 ggc"cagcc gcLcggcca catccacatc gggcgcgtgt acctcatcta cggcaatgac 
1381 ctgggcctgc cacctgttga cctggacctg gacaaggagg cccacaggat ccttgaaggc 
1441 tticagcclt caggtcggtt tggctcggcc ttggctgtgt tggactttaa cgtggacggc 
1501 gtgcctgacc tggccgtggg agctccctcg gtgggctccg agcagctcac ctacaaaggt 
1561 gccgtgiatg tctactttgg ttccaaacaa ggaggaatgt cttcttcccc taacatcacc 
1621 atttcttgcc aggacatcta ctgtaacttg ggctggactc tcttggctgc agatgtgaat 
1681 ggagacagtg aacccgatct ggtcatcggc tccccttttg caccaggtgg agggaagcag 
1741 aagggaattg tggctgcgtt ttattctggc cccagcctga gcgacaaaga aaaactgaac 
1801 gtggaggcag ccaactggac ggtgagaggc gaggaagact tctcctggtt tggatattcc 
1861 cttcacggtg tcactgtgga caacagaacc ttgctgttgg ttgggagccc gacctggaag 
1921 aatgccagca ggctgggcca tttgttacac atccgagatg agaaaaagag ccttgggagg 
1981 gtgtatggct acttcccacc aaacggccaa agctggttta ccatttctgg agacaaggca 
2041 atqgggaaac tgggtacttc cctttccagt ggccacgtac tgatgaatgg gactctgaaa 
2101 caaqtgctgc tggttggagc ccctacgtac gatgacgtgt ctaaggtggc attcctgacc 
2161 gtgaccctac accaaggcgg agccactcgc atgtacgcac tcatatctga cgcgcagcct 
2221 ctgctgctca gcaccttcag cggagaccgc cgcttctccc gatttggtgg cgttctgcac 
2281 ttqagtgacc tggatgatga tggcttagat gaaatcatca tggcagcccc cctgaggata 
2341 gcagatgtaa cctctggact gattggggga gaagacggcc gagtatatgt atataatggc 
2401 aaaqagacca cccttggtga catgactggc aaatgcaaat catggataac tccatgtcca 
2461 gaagaaaagg cccaatatgt attgatttct cctgaagcca gctcaaggtt tgggagctcc 
2521 ctcatcaccg tgaggtccaa ggcaaagaac caagtcgtca ttgctgctgg aaggagttct 
2581 ttggqagccc gactctccgg ggcacttcac gtctatagcc ttggctcaga ttgaagattt 
2641 cactqcattt ccccactctg cccacctctc tcatgctgaa tcacatccat ggtgagcatt 
2701 ttgatggaca aagtggcaca tccagtggag cggtggtaga tcctgataga catggggctc 
27 61 ctgggagtag agagacacac taacagccac accctctgga aatctgatac agtaaatata 
2821 tgactgcacc ag 



Figure 5: Human GPI-PLD cDNA clone b2 
2472 bp: 617 a 588 c 639 g 628 t 



1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
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1681 
1741 
1801 
1861 
1921 
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2041 
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2161 
2221 
2281 
2341 
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2461 



gtctgcacct 
agagaagccg 
catgttgggt 
aggacacaga 
gttactagaa 
tagcatctgc 
tcttaatgca 
agagaaactg 
gcatagtctg 
ctcctattca 
atttaatttt 
ttatgagaaa 
tatccagttc 
ctctacaaag 
tgatatggca 
gaccagtgac 
ccacacccag 
aactgaaagt 
ttcctggacc 
aatgttcata 
cttcttgtca 
ggatgggcac 
catcgggcgc 
cctggacaag 
ggccttggct 
ctcggtgggc 
acaaggagga 
cttgggctgg 
cggctcccct 
tggccccagc 
aggcgaggaa 
aaccttgctg 
acacatccga 
ccaaagctgg 
cagtggccac 
gtacgatgac 
tcgcatgtac 
ccgccgcttc 
agatgaaatc 
gggagaagac 
tggcaaatgc 
aaaaaaaaaa 



ggattttgga 
gtgggcaatg 
tctctctgcc 
gctctggagt 
caccaggatg 
aaaggaggaa 
agcgttcatt 
gtagctttct 
ggccttgaac 
gaggctcatt 
aattaccttg 
ctgtatggtc 
ttagaaatgt 
tccccgtttt 
ttttggtcca 
tgcaacctgc 
ggctcaaaaa 
gttgacagga 
ccggattcca 
ggtggctctc 
tttccttatg 
ggtgacctcg 
gtgtacctca 
gaggcccaca 
gtgttggact 
tccgagcagc 
atgtcttctt 
actctcttgg 
tttgcaccag 
ctgagcgaca 
gacttctcct 
ttggttggga 
gatgagaaaa 
tttaccattt 
gtactgatga 
gtgtctaagg 
gcactcatat 
tcccgatttg 
atcatggcag 
ggccgagtat 
aaatcatgga 
aa 



gtcccagtgc 
agagcatgtc 
atagaggttc 
ttcttcagct 
cgtatcaggc 
aattccatga 
atatccgaga 
tgtttggaat 
aaggattcct 
cggctggtga 
cacgacgctg 
gaaaagtcat 
atggtgagat 
tggtggaaca 
ctaatattta 
ctgagaaccc 
tgcagaaaaa 
atataaacta 
tgtcctttat 
agttgtcaca 
cgaggcttgg 
tggtgggcgc 
tctacggcaa 
ggatccttga 
ttaacgtgga 
tcacctacaa 
cccctaacat 
ctgcagatgt 
gtggagggaa 
aagaaaaact 
ggtttggata 
gcccgacctg 
agagccttgg 
ctggagacaa 
atgggactct 
tggcattcct 
ctgacgcgca 
gtggcgttct 
cccccctgag 
atgtatataa 
taactccatg 



tgctgcagct 
tgctttcagg 
accgtgtggc 
tcacaatggg 
tggaatcgtg 
tgtgtctgag 
gaactatccc 
tacttctcac 
taggaccatg 
ttttggagga 
gtatgtgcca 
caccgaaaat 
gctagctgtt 
attccaagag 
ccatctaaca 
tctgttcatt 
tgattttcac 
tactgaaaga 
ctacaaggct 
aaagcacgtc 
ctgggcaatg 
accaggctac 
tgacctgggc 
aggcttccag 
cggcgtgcct 
aggtgccgtg 
caccatttct 
gaatggagac 
gcagaaggga 
gaacgtggag 
ttcccttcac 
gaagaatgcc 
gagggtgtat 
ggcaatgggg 
gaaacaagtg 
gaccgtgacc 
gcctctgctg 
gcacttgagt 
gatagcagat 
tggcaaagag 
tccagaagaa 



ctgagcattc 

ttgtggcctg 

ctttcaacac 

cgtgttaact 

tttcctgatt 

agcactcact 

cttccctggg 

atggcggcag 

ggagctattg 

gatgtgttga 

gtcaaagatc 

gtaatcgttg 

tccaagttat 

tattttcttg 

agcttcatgt 

gcatgtggcg 

agaaatttga 

ggagtgttct 

ttggaaagga 

tccagcccct 

acctcagctg 

agccgccccg 

ctgccacctg 

ccctcaggtc 

gacctggccg 

tatgtctact 

tgccaggaca 

agtgaacccg 

attgtggctg 

gcagccaact 

ggtgtcactg 

agcaggctgg 

ggctacttcc 

aaactgggta 

ctgctggttg 

ctacaccaag 

ctcagcacct 

gacctggatg 

gtaacctctg 

accacccttg 

aaggtaagtg 



ccacgtcacc 

gcctgctgat 

acatagaaat 

acagagagct 

gtttttaccc 

ggactccgtt 

agaaggacac 

atgtcagctg 

attttcacgg 

gccagtttga 

tactgggaat 

attgttcaca 

atcccactta 

gaggactgga 

tggagaatgg 

gccagcaaaa 

ctacatccct 

ttagtgtaaa 

acataaggac 

tagcatctta 

acctcaacca 

gccacatcca 

ttgacctgga 

ggtttggctc 

tgggagctcc 

ttggttccaa 

tctactgtaa 

atctggtcat 

cgttttattc 

ggacggtgag 

tggacaacag 

gccatttgtt 

caccaaacgg 

cttccctttc 

gagcccctac 

gcggagccac 

tcagcggaga 

atgatggctt 

gactgattgg 

gtgacatgac 

aaaaaaaaaa 



\0I\Z # 



Figure 6: Human GPI-PLD cDNA clone d3 
1942 bp: 455 a 496 c 502 g 4 



1 gggctgtaac tctgccatcc ctcagcataa 
61 cctgtcctaa gtgatcttac ttgctgatag 
121 ttctaaaaac tcatttcctt tacacaagtc 
181 ttgattatgc tacgtgtctt tactgtctat 
241 ctttatctac aaggctttgg aaaggaacat 
301 gtcacaaaag cacgtctcca gccccttagc 
361 gcttggctgg gcaatgacct cagctgacct 
421 gggcgcacca ggctacagcc gccccggcca 
481 cggcaatgac ctgggcctgc cacctgttga 
541 ccttgaaggc ttccagccct caggtcggtt 
601 cgtggacggc gtgcctgacc tggccgtggg 
661 ctacaaaggt gccgtgtatg tctactttgg 
721 taacatcacc atttcttgcc aggacatcta 
781 agatgtgaat ggagacagtg aacccgatct 
841 agggaagcag aagggaattg tggctgcgtt 
901 aaaactgaac gtggaggcag ccaactggac 
961 tggatattcc cttcacggtg tcactgtgga 
1021 gacctggaag aatgccagca ggctgggcca 
1081 ccttgggagg gtgtatggct acttcccacc 
1141 agacaaggca atggggaaac tgggtacttc 
1201 gactctgaaa caagtgctgc tggttggagc 
1261 attcctgacc gtgaccctac accaaggcgg 
1321 cgcgcagcct ctgctgctca gcaccttcag 
1381 cgttctgcac ttgagtgacc tggatgatga 
1441 cctgaggata gcagatgtaa cctctggact 
1501 atataatggc aaagagacca cccttggtga 
1561 tccatgtcca gaagaaaagg cccaatatgt 
1621 tgggagctcc ctcatcaccg tgaggtccaa 
1681 aaggagttct ttgggagccc gactctccgg 
1741 ttgaagattt cactgcattt ccccactctg 
1801 ggtgagcatt ttgatggaca aagtggcaca 
1861 catggggctc ctgggagtag agagacacac 
1921 agtaaatata tgactgcacc ag 



tttgggggta tgatttcact atcctaattg 
gacctaatgt tttattttat tgtttagcac 
caatactttg gacaggaaac agtagctttg 
aatgattctt ttatttcagg attccatgtc 
aaggacaatg ttcataggtg gctctcagtt 
atcttacttc ttgtcatttc cttatgcgag 
caaccaggat gggcacggtg acctcgtggt 
catccacatc gggcgcgtgt acctcatcta 
cctggacctg gacaaggagg cccacaggat 
tggctcggcc ttggctgtgt tggactttaa 
agctccctcg gtgggctccg agcagctcac 
ttccaaacaa ggaggaatgt cttcttcccc 
ctgtaacttg ggctggactc tcttggctgc 
ggtcatcggc tccccttttg caccaggtgg 
ttattctggc cccagcctga gcgacaaaga 
ggtgagaggc gaggaagact tctcctggtt 
caacagaacc ttgctgttgg ttgggagccc 
tttgttacac atccgagatg agaaaaagag 
aaacggccaa agctggttta ccatttctgg 
cctttccagt ggccacgtac tgatgaatgg 
ccctacgtac gatgacgtgt ctaaggtggc 
agccactcgc atgtacgcac tcatatctga 
cggagaccgc cgcttctccc gatttggtgg 
tggcttagat gaaatcatca tggcagcccc 
gattggggga gaagacggcc gagtatatgt 
catgactggc aaatgcaaat catggataac 
attgatttct cctgaagcca gctcaaggtt 
ggcaaagaac caagtcgtca ttgctgctgg 
ggcacttcac gtctatagcc ttggctcaga 
cccacctctc tcatgctgaa tcacatccat 
tccagtggag cggtggtaga tcctgataga 
taacagccac accctctgga aatctgatac 



Figure 7: Alignment of GPIPLD protein sequences 



database MSAFRLWPGLLIMLGSLCHRGSPCGLSTHVEIGHRALEFLQLHNGRVNYRELLLEHQDAY 60 

d3 

fc>2 MSAFRLWPGLLIMLGSLCHRGS PCGLSTHI E I GHRALE FLQLHNGRVNYRELLLEHQDAY 60 

a 1 MSAFRLWPGLLIMLGSLCHRGS PCGLSTHI E I GHRALE FLQLHNGRVNYRELLLEHQDAY 6 0 

database QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

d3 

b2 QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

a l QAGIVFPDCFYPSICKGGKFHDVSESTHWTPFLNASVHYIRENYPLPWEKDTEKLVAFLF 120 

database GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 180 

d3 

fc>2 GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFEFNFNYLAR 180 

al GITSHMAADVSWHSLGLEQGFLRTMGAIDFHGSYSEAHSAGDFGGDVLSQFE FNFNYLAR 180 

database RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 2 40 

d3 

b2 RWYVPVKDLLGIYEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 240 

al RWYVPVKDLLGI YEKLYGRKVITENVIVDCSHIQFLEMYGEMLAVSKLYPTYSTKSPFLV 2 40 

database EQFQEYFLGGLDDMAFWSTNIYHLTIFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

d3 

b2 EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

al EQFQEYFLGGLDDMAFWSTNIYHLTSFMLENGTSDCNLPENPLFIACGGQQNHTQGSKMQ 300 

database KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSFIYKALERNIRTMFIGGSQL 360 

d3 MI LLFQDSMS FI YKALERNI RTMFI GGSQL 3 0 

b2 KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSF I YKALERNI RTMFI GGSQL 3 60 

al KNDFHRNLTTSLTESVDRNINYTERGVFFSVNSWTPDSMSF I YKALERNI RTMFI GGSQL 360 

database SQKHVSS PLAS Y FLS FPYARLGWAMTSADLNQDGHGDLWGAPGYSRPGHIHI GRVYLI Y 420 

d3 SQKHVSSPLASY FLS FPYARLGWAMTSADLNQDGHGDLVVGAPGYSRPGHIHI GRVYLI Y 90 

b2 SQKHVSS PLAS Y FLS FPYARLGWAMTSADLNQDGHGDLVVGAPGYSRPGHIHI GRVYLI Y 420 

al SQKHVSSPLASY FLS FPYARLGWAMTSADLNQDGHGDLVVGAPGYSRPGHIHI GRVYLI Y 420 

database GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 480 

d3 GNDLGLPPVDLDLDKEAHRILEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 150 

b2 GNDLGLPPVDLDLDKEAHRI LEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 480 

al GNDLGLPPVDLDLDKEAHRI LEGFQPSGRFGSALAVLDFNVDGVPDLAVGAPSVGSEQLT 480 

database YKGAVY VY FGS KQGGMS S S PNI T I S CQD I Y CNLGWTLLAADVNGDSE PDLVI GS P FAPGG 540 

d3 YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPDLVIGSPFAPGG 210 

b2 YKGAVYVYFGSKQGGMSSSPNITISCQDIYCNLGWTLLAADVNGDSEPDLVIGSPFAPGG 540 

al YKGAVY VYFGSKQGGMSSSPNITIS CQD I YCNLGWTLLAADVNGDSEPDLVIGSPFAPGG 540 

database GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 600 

d3 GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSF 270 

b2 GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 600 

al GKQKGIVAAFYSGPSLSDKEKLNVEAANWTVRGEEDFSWFGYSLHGVTVDNRTLLLVGSP 600 

database TWKNAS R LGHLLH I RDE KKS LGRVY GY F P PNGQSWFT I S GDKAMGKLGT S LS S GHVLMNG 660 

d3 TWKNAS R LGHLLH I RDE KKS LGRVY GY F P PNGQSW FT I S GDKAMGKLGT S LS S GHVLMNG 330 

b2 TWKNAS R L GH L LH I RDE KKS LGR VY GYFP PNGQSW FT I SGDKAMGKLGTSLSS GHVLMNG 660 

a 1 TWKNAS RLGHLLHIRDE KKS LGRVY GY F P PNGQS W FT I S GDKAMGKLGT S LS S GHVLMNG 660 

database TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 720 

d3 TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALISDAQPLLLSTFSGDRRFSRFGG 390 

b2 TLKQVLLVGAPT YDDVSKVAFLTVTLHQGGATRMYALI SDAQPLLLST FS GDRRFSR FGG 720 

al TLKQVLLVGAPTYDDVSKVAFLTVTLHQGGATRMYALI SDAQPLLLST FSGDRRFSRFGG 720 



• 12/ 13 # 



7-2 



database 

d3 

b2 

al 

database 

d3 

b2 

al 



VLHLSDLDDDGLDEIIIWAPI^I^DVTSGLIGGEDGRVYV^ 
VLHLSDLDDDG^DEIIMAAPLRIADVTSGLIGGEDGR^ 

PCPEEKAQYVLISPEASSRFGSSLITVRSKAKNQVVIAAGRSSLGJ^LSGALHVYSLGSD 

pcpIeSS^Sspeassrfgsslitvrskaknqwiaagrssl^lsg^ 



780 
450 
780 
780 

840 
510 
795 
840 



Database 

d3 

b2 

al 



840 aa 
510 aa 
795 aa 
840 aa 



8.1 



Figure 8: Alignment of human GPI-PLD nucleic acid sequences 

1: pancreatic-form: cDNA sequence from GenBank database (L11702) 

2: cDNA clone Al 

3: cDNA clone B2 

4: cDNA clone D3 



1 GT GAC CT GCTT AGAGAGAAGC GGT GGGT CT GCACCT GGATTTT GGAGT CC CAGT GCT GCT 60 
x GT CT GCAC CT GGATTTT GGAGT CC CAGT GCT GCT 34 



ATGTCTGCT 9 

GCAGCT CT GAGCATT C CCAC GT CAC CAGAGAAGCCGGT GGGGAAT GAGAGCAT GT CT GCT 12 0 
GCAGCT CT GAGCATT C C CAC GT CACC AGAGAAGC CGGT GGGCAAT GAGAGCAT GT CT GCT 9 4 



10 TT CAGGTT GT GGCCT GGC CT GCT GATCATGTT GGGTT CT CT CT GC CATAGAGGTT CAC C G 
121 TT CAGGTT GT GGCCT GGCCT GCT GAT CAT GTT GGGTTCT CT CT GCCATAGAGGTT CACC G 
95 TTCAGGTTGTGGCCTGGCCTGCTGATCATGTTGGGTTCTCTCTGCCATAGAGGTTCACCG 



69 
180 
154 



7 0 T GT GGCCTTT CAACACACGTAGAAAT AGGACACAGAGCT CT GGAGTTT CTT CAGCTT CAC 129 
181 T GT GGCCTTT CAACACACAT AGAAAT AGGACACAGAGCT CT GGAGTTT CTT CAGCTT CAC 240 
155 T GTGGCCTTT CAACACACATAGAAATAGGACACAGAGCTCTGGAGTTTCTTCAGCTT CAC 



214 



130 AAT GGGC GT GTTAACT ACAGAGAGCT GTTACT AGAACACCAGGAT GC GT AT CAGGCT GGA 189 
241 AAT GGGCGT GTT AACTACAGAGAGCT GTTACTAGAACAC CAGGAT GC GT AT CAGGCT GGA 300 
215 AAT GGGC GT GTTAACT ACAGAGAGCT GTT ACT AGAACAC CAGGAT GCGTAT CAGGCT GGA 274 



190 AT CGT GTTT CCT GATT GTTTTT AC C CT AGCAT CT GCAAAGGAGGAAAATT C CAT GAT GT G 249 
301 ATCGTGTTTCCTGATTGTTTTTACCCTAGCATCTGCAAAGGAGGAAAATTCCATGATGTG 360 
27 5 AT CGT GTTT C CT GATT GTTTTT ACCCT AGCAT CT GCAAAGGAGGAAAATT CCAT GAT GTG 334 



250 T CT GAGAGCACT CACT GGACT CC GTTTCTT AAT GCAAGCGTT CATTAT AT C CGAGAGAAC 309 
361 T CT GAGAGCACTCACT GGACT C C GTT T CTTAATGCAAGC GTT CATTAT AT C CGAGAGAAC 420 
335 T CT GAGAGCACT CACT GGACT CCGTTT CTT AAT GCAAGCGTT CATTAT AT C CGAGAGAAC 394 



310 TATCCCCTTCCCTGGGAGAAGGACACAGAGAAACTGGTAGCTTTCTTGTTTGGAATTACT 369 
421 TATCCCCTT CCCT GGGAGAAGGACACAGAGAAACT GGTAGCTTTCTTGTTT GGAATTACT 480 
395 T AT CC C CTT C C CT GGGAGAAGGACACAGAGAAACT GGTAGCTTT CTT GT T T GGAATTACT 454 



37 0 T CT CACAT GGC GGCAGAT GT CAGCT GGCAT AGT CT GGGC CTT GAACAAGGATT C CTT AGG 429 
481 T CT CACAT GGC GGCAGAT GT CAGCT GGCAT AGT CT GGGC CTT GAACAAGGATT C CTTAGG 540 
541 T CT CACAT GGC GGCAGAT GT CAGCT GGCAT AGT CT GGGCCTT GAACAAGGATT CCTTAGG 514 



430 AC CAT GGGAGCT AT T GATTTT CAC GGCT CCT ATT CAGAGGCT CATT CGGCT GGT GATTTT 489 
541 ACCATGGGAGCTATT GATTTT CAC GGCT CCTATT CAGAGGCT CATT CGGCT GGT GATTTT 600 
515 ACCATGGGAGCTATT GATTTT CAC GGCT CCTATT CAGAGGCT CATT CGGCT GGT GATTTT 574 



6. L 



490 GGAGGAGAT GT GTT GAGC CAGTTT GAATTTAATTTT AATTACCTTGCAC GAC GCT GGT AT 549 
601 GGAGGAGAT GT GTT GAGC CAGTTT GAATTTAATTTTAATTACCTTGCAC GAC GCT GGT AT 660 
575 GGAGGAGATGTGTTGAGCCAGTTTGAATTTAATTTTAATTACCTTGCACGACGCTGGTAT 634 



550 GT GCCAGT CAAAGAT CTACT GGGAATTT AT GAGAAACT GTATGGTCGAAAAGT CAT CACC 609 
661 GT GC CAGT CAAAGAT CTACT GGGAATTT AT GAGAAACT GTAT GGT CGAAAAGT CAT CACC 720 
635 GT GCCAGT CAAAGAT CTACT GGGAATTTAT GAGAAACT GTATGGTCGAAAAGT CAT CAC C 694 



610 GAAAAT GT AAT CGTT GATT GTT CACATAT CCAGTT CTTAGAAAT GTAT GGT GAGAT GCTA 669 
721 GAAAAT GT AAT C GTT GATT GTT CACATAT CCAGTT CTTAGAAAT GTAT GGT GAGAT GCTA 780 
695 GAAAAT GT AAT C GTT GATT GTT CACATATC CAGTT CTTAGAAAT GTAT GGT GAGAT GCTA 754 



670 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 729 
7 81 GCT GTTT CCAAGTTATAT C CCACTTACT CTACAAAGT C CC CGTTTTTGGT GGAACAATT C 840 
755 GCTGTTTCCAAGTTATATCCCACTTACTCTACAAAGTCCCCGTTTTTGGTGGAACAATTC 814 



730 CAAGAGTATTTT CTTGGAGGACT GGATGATAT GGCATTTT GGTC CACTAAT ATTTACCAT 7 8 9 
841 CAAGAGTATTTT CTT GGAGGACT GGAT GAT AT GGCATTTTGGT CCACTAAT ATTTACCAT 900 
815 CAAGAGTATTTT CTT GGAGGACT GGAT GAT AT GGCATTTT GGT C CACTAAT ATTTACCAT 874 
GGGCT GTAAC 10 



7 90 CTAACAAT CTT CAT GTT GGAGAAT GGGAC CAGT GACT GCAAC CT GC CT GAGAACCCT CT G 849 

901 CTAACAAGCTT CAT GTT GGAGAAT GGGAC CAGT GACTGCAACCT GCCT GAGAACC CT CT G 960 

875 CTAACAAGCTT CAT GTT GGAGAAT GGGAC CAGT GACT GCAACCT GC CT GAGAACC CT CT G 934 

1 1 T CT GC CATCCCT CAGCAT AATTT GGGGGTAT GATTT CACTAT CCTAATT GC CT GT CCTAA 7 0 



850 TT C ATTGCAT GT GGC GGC CAGCAAAACCACACCCAGGGCT CAAAAAT GCAGAAAAAT GAT 909 

961 TT CATT GGAT GT GGC GGC CAGCAAAAC CACACCCAGGGCTCAAAAAT GCAGAAAAATGAT 102 0 

935 TT CATT GGAT GT GGC GGC CAGCAAAAC CACAC CCAGGGCT CAAAAAT GCAGAAAAAT GAT 994 

7 1 GT GAT CTTACTT GCT GAT AGGAC CTAAT GTTTTATTTTATT GTT T AGGACTT CTAAAAAC 130 

910 TTT CACAGAAATTT GACTACAT CC CTAACT GAAAGT GTT GACAGGAATATAAACTATACT 969 

1021 TTT CACAGAAATTT GACTACAT CCCTAACT GAAAGT GTT GACAGGAATATAAACTATACT 108 0 

995 TTT CACAGAAATTT GACTACAT CCCTAACT GAAAGT GTT GACAGGAATATAAACTATACT 1054 

131 T CATTT CCTTT ACACAAGT CC7\AT ACTTT GGACAGGAAACAGTAGCTTT GTT GATTAT GC 180 



970 GAAAGAGGAGT GTT CTTTAGT GTAAATT C CT GGACC CC GGAT T C CAT GT CCTTT AT CTAC 1029 

1081 GAAAGAGGAGTGTTCTTTAGTGTAAATTCCTGGACCCCGGATTCCATGTCCTTTATCTAC 1140 

1055 GAAAGAGGAGT GTT CTTTAGT GTAAATT CCT GGACC CCGGATTC CAT GTCCTTTAT CTAC 1114 

181 TACGT GT CTTT ACT GT CT ATAAT GATT CTTTT ATTT CAGGATT C CAT GT CCTTT AT CTAC 240 



1030 AAGGCTTTGGAAAGGAACATAAGGACAAT GTT CATAGGT GGCT CT CAGTT GT CACAAAAG 1089 

1141 AAGGCTTT GGAAAGGAACATAAGGACAAT GTT CATAGGT GGCT CT CAGTT GT CACAAAAG 1200 

1115 AAGGCTTTGGAAAGGAACATAAGGACAATGTTCATAGGTGGCTCTCAGTTGTCAC/iLAAAG 1174 

241 AAGGCTTTGGAAAGGAACATAAGGACAAT GTT CATAGGT GGCT CT CAGTT GT CACAAAAG 300 



1090 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1149 

1201 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 1260 

1175 CACGTCTCCAGCCCCTTAGCAT CTTACTT CTT GTCATTTCCTTATGCGAGGCTT GGCT GG 1234 

301 CACGTCTCCAGCCCCTTAGCATCTTACTTCTTGTCATTTCCTTATGCGAGGCTTGGCTGG 360 



• 15/1*3 • 



115 o ^^crc,^c^c^^c = ^^^^ [HI 



361 



1269 



1235 GCAATGAU^i^ -. CTCARCCAGGATGGGC ACGGTGACCTCGTGGTGGGCGCACCA 420 

-CACATCCACATCGGGCGCGTGTACCTCATCTACGGCAATGAC 
CAC AT C CACATC GGGC GC GT GT ACCT CAT CT ACGGCAAT GAC 
C CAC AT C CACATC GGGC GC GT GT ACCT CAT CT ACGGCAAT GAC 
C CACAT C CACAT C GGGC GC GT GT AC CT CAT CT ACGGCAAT GAC 

T GAC CT GGAC CT GGAC AAGGAGGCC CACAGGAT CCT T GAAGGC 
TGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 

T GAC CT GGAC CT GGAC AAGGAGGC C CACAGGAT CCT T GAAGGC 
TGACCTGGACCTGGACAAGGAGGCCCACAGGATCCTTGAAGGC 

iGTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 
GTTTGGCTCGGCCTTGGCTGTGTTGGACTTTAACGTGGACGGC 

M1 ^^SSSSSSSSS^^^^SSSSSSSS^ 



1509 



1630 AAGGGAATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 1689 

llll ^ATC^AATTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGACAAAGAAAAACTGAAC 1800 

VlVk ^gSg^TTGTGGCTGCGTTTTATTCTGGCCCCAGCCTGAGCGAC^GAAAAACTGAAC 1774 

III ^g^ttgtggctgcgttttattctggcccc^gcct^^ 900 



J??5 SSXIS^^ 1834 



1750 CTTCACGGTGTCACTGTGGACAACAGAACCTTGCTGTTGGTTGGGAGCCCGACCTGGAAG 1809 
lltl CTTCaStGTCACTGTGGAC^C^GAACCT^^^ 1894 

961 cttScggtgtcactgtg^c^c^gaaccttgctgttggttgggagcccgacct^ 1020 



16/1? 




1869 
1980 
1954 



1080 



1929 
2040 



a ssas-aaaaasssjB. I 

2049 




iJil ATGGGGAAACTGGGTACTTCCCTT' 

(TACGATGACGTGTCTAAGGTGGCATTCCTGACC 



2169 



1380 



2229 
2340 
2314 
1440 



2110 cnaicBBBaxncvc-ZS^S^^ZZ^^™ 1115 



1500 
2349 



1560 



2409 
2520 



1561 ^gWggcccaatatgtattgatttctcctgaagcca^ 



l 1680 



162 





2529 
2640 





==-==^^ 1320 

1921 TGACTGCACCAGAAAAAAAAAAAAAAAAflAAR 

rTTCCTCGCTCCCTTTGCATCTAAACCTTTCTTCTTTCCCfiACTTATTGCCTGTA 2629 



2330 0^00^3^00™^^ 



2889 



289 o aggtoooto^™^^^ 2949 



2950 CTTTTTTAAAAAAA" 



GAATTGTTCAGCCATAAAGAAAGAAl 



CAAGAT CAT GC CCTTT GCAGG 



3009 



301 o gao^t---^^^ 306? 



30,0 AOAOTOO*™™™^^ 



* 4 • \w • 



0 ^ 



3130 AAACAACACACAC T GGGGC CT AT GAGAGGGC GGAAGGT GGGAGGAGGGAGAGAT CAGGAA 3189 



3190 AAAT AAC T AAT GGAT AC T T AGGGT GAT GAAAT AAT C T GT GT AAC AAAC C C C CAT GACACA 3249 



3250 C CTTT AT GTAT GTAACAAAC CAGCACTT C CTGC GCAT GTACCCCT GAACTTAAAAGTTAA 3309 



3310 AAAAAAGT T GAAC T T AAAAAT AAC AGAT T GGC C CAT GC CAAT CAAAGT AT AAT AGAAAGC 3369 



3370 ATAGTATAC 3378 




Pv tN ^Sc- ^ VVs 



